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Abstract 

We introduce a square root map on Sturmian words and study its properties. Given a Sturmian 
word of slope a, there exists exactly six minimal squares in its language (a minimal square does 
not have a square as a proper prefix). A Sturmian word s of slope a can be wriffen as a producf 
of these six minimal squares: s = XjX^Xg • • •. The square root of s is defined to be the word 
= X1X2X3 ■ ■ ■. The main result of this paper is that that ^/s is also a Sturmian word of slope 
a. Further, we characterize the Sturmian fixed points of the square root map, and we describe 
how to find the intercept of ^/s and an occurrence of any prefix of ^/s in s. Related to the square 
root map, we characterize the solutions of the word equation XjX| ■ ■ ■ = (X1X2 ■ ■ ■ X„)^ in 

the language of Sturmian words of slope a where the words Xj are minimal squares of slope a. 

We also study the square root map in a more general setting. We explicitly construct an 
infinite set of non-Sturmian fixed points of the square root map. We show that the subshifts 
Q generated by these words have a curious property: for all a; 6 O either -^/w 6 Q or ^/^v is 
periodic. In particular, the square root map can map an aperiodic word to a periodic word. 

Keywords-, sturmian word, standard word, optimal squareful word, word equation, continued fraction 


1 Introduction 

Kalle Saari studies in [16,17] optimal squareful words which are aperiodic words containing the 
least number of minimal squares (fhaf is, squares wifh no proper square prefixes) such fhaf every 
posifion sfarfs a square. Saari proves fhaf an optimal squareful word always confains exacfly 
six minimal squares, and he characferizes fhese squares; less fhan six minimal squares forces a 
word fo be ulfimafely periodic. Moreover, he shows fhaf Sfurmian words are a proper subclass 
of opfimal squareful words. 

We propose a square roof map for Sfurmian words. Lef s be a Sfurmian word of slope a, 
and wrife if as a producf of fhe six minimal squares in ifs language £(a): s = X^X^Xg ■ ■ ■. The 
square roof of s is defined fo be fhe word ^/s = X 1 X 2 X 3 ■ ■ ■. The main resulf of fhis paper is 
fhaf fhe word ^ is also a Sfurmian word of slope a. More precisely, we prove fhat fhe square 
roof of fhe Sfurmian word Sx,a of intercepf x and slope cc is s^p(^x),a where ip{x) = j{x + 1 — a). 
In addition fo proving fhaf fhe square roof map preserves fhe language of a Sfurmian word s, 
we show how fo locafe any prefix of -y/s in s. We also characferize fhe Sfurmian words of slope 
a which are fixed poinfs of fhe square roof map; fhey are fhe two Sfurmian words 01ca and 
10c« where c« is fhe infinife sfandard Sfurmian word of slope a. The majorify of fhe proofs 
of resulfs on Sfurmian words rely heavily on fhe inferprefafion of Sfurmian words as rofafion 
words. Continued fracfions and resulfs from Diophanfine approximafion fheory play a key role 
in several proofs. 
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Solutions of the word equation Xj ■ ■ ■ = (Xj ■ ■ ■ X„)^ where the words X? are among the 

six minimal squares in £(a) for some fixed irrational oc are closely linked to the square root map. 
The study of these solutions to this word equation arises naturally from the study of fixed points 
of the square root map. The Sturmian fixed points of the square root map are fixed because they 
have arbitrarily long prefixes Xf ■ ■ ■ which satisfy the word equation. We characterize these 
specific solutions, i.e., those primitive words zv such that zv^ e >C(a) and zv^ can be written as 
a product of minimal squares Xf ■ ■ ■ X^ satisfying the word equation. On the circle [0,1), the 
interval [if] of such a word zv can be seen to satisfy the square root condition !/’([if^]) C [zv], so 
we instead study and characterize the primitive words satisfying this square root condition. The 
result is that the specific solutions to the word equation (or, equivalently, the primitive words 
satisfying the square root condition) are the reversals of standard and semistandard words of 
slope a (see Subsection 2.3 for a definition) and the reversed standard words with the first two 
letters exchanged. In particular, all of these specific solutions are nonperiodic. It was known that 
the word equation (X^ ■ ■ ■ X^) = Xj ■ ■ ■ X^ has nonperiodic solutions [7], but according to our 
knowledge no large families of nonperiodic solutions have been identified until our result. Word 
equations of the t 5 ipe Xj ■ ■ ■ X^ = [X^ - ■ ■ X„)^ have been considered by Stepan Holub [6, 7, 8]. 

The final central topic of this paper concerns the square root map in a more general setting. 
The square root map can be defined not only for Sturmian words but for any optimal square¬ 
ful word. We construct an infinite family of non-Sturmian, linearly recurrent optimal square¬ 
ful words T with properties similar to Sturmian words. The words T are fixed points of the 
square root map. They are constructed by finding non-Sturmian solutions of the word equation 
Xj ■ ■ ■ X^ = (Xi ■ ■ ■ X,i)^ and by building infinite words having arbitrarily long squares of such 
solutions as prefixes. The subshifts O generated by the words T exhibit behavior similar to Stur¬ 
mian subshifts. The square root map preserves the language of several but not every word in 
n. Curiously, if the language of a word in O is not preserved under the square root map, then 
the image must be periodic. This result is very surprising since it is contrary to the plausible 
hypothesis that the square root of an aperiodic word is aperiodic. 

The paper is organized as follows. In Section 3 we prove that the square root map preserves 
the language of a Sturmian word. As a corollary we obtain a description of those Sturmian 
words which are fixed points of the square root map. In Section 3 we observe that the intervals 
of the minimal squares in £(a) satisfy the square root condition. In Section 4 we characterize all 
words zv^ € £(a) satisfying the square root condition. The result is that zv^ with zv primitive 
satisfies the square root condition if and only if if is a reversed standard or semistandard word 
or a reversed standard word with the first two letters exchanged. Section 5 contains a proof 
of the characterization of the specific solutions of the word equation X^ ■ ■ ■ X^ = (X^ ■ ■ ■ X„)^ 
mentioned earlier. We show that a primitive word zv satisfies the square root condition if and only 
if zv^ can be written as a product of minimal squares satisfying the word equation. In Section 6 we 
show how to locate prefixes of -y/s in s. As an important step in proving this, we provide necessary 
and sufficient conditions for a Sturmian word to be a product of squares of reversed standard 
and semistandard words. We give a formula describing the square root of the Fibonacci word 
in Section 7. Section 8 is devoted to constructing the non-Sturmian fixed points T mentioned 
above and to demonstrating that the languages of the words in their subshifts are preserved or 
they are mapped to periodic words. We conclude the paper by giving some remarks on possible 
generalizations in Section 9 and by discussing a few open problems in Section 10. 

A short version of this paper was published as an extended abstract in the proceedings of 
WORDS 2015 [13]. 

2 Notation and Preliminary Results 

In this section we review notation and basic concepts and results of word combinatorics, optimal 
squareful words, continued fractions, and Sturmian words. Most of the definitions and results 
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provided here about words can be found in Lothaire's book [11], 

An alphabet A is a finite non-empty set of letters, or symbols. A (finite) ivord over A is a finite se¬ 
quence of letters of A obtained by concatenation. The concatenation of two words u = aQ ■ ■ ■ a„_i 
and V = bo ■ ■ ■ is the word u ■ v = uv = a^ ■ ■ ■ a„_ibo ■ ■ ■ bm-i- In this paper we consider only 
binary words, that is, words over an alphabet of size two. Most of the time we take A to be the set 
{0,1}. The set of nonempty words over A is denoted by A~^. We denote the empty word by e and set 
A* = A+ U {e}. A nonempty subset of A* is called a language. Let w = a^a^ ■ ■ ■ fl„_i be a word of 
n letters. We denote the length n of why 1^1; by convention |e| =0. The set of proper powers of a 
word w is denoted by w^. 

An infinite word w over the alphabet A is a function from the normegative integers to A. We 
write concisely w = ■ ■ ■ with a, e A. The set of infinite words over A is denoted by A^. An 

infinite word w is said to be ultimately periodic if we can write it in the form w = uv^ = uvvv ■ ■ ■ 
for some words u,v ^ A*. If u = e, then w is said to be periodic, or purely periodic. An infinite 
word which is not ultimately periodic is aperiodic. The shift operator T acts on infinite words as 
follows: r(flofllfl 2 ■ • ■) = ^ 1^2 ■ ■ ■ ■ 

A finite word m is a factor of the finite or infinite word w if we can write w = vuz for some 
u e A* and z e A* U A“. If v = e, then the factor u is called a prefix of a;. If z = e, then we say that 
u is a suffix of IV. The set of factors of w, the language ofw, is denoted by L{w). Ifw = 
then we let zv[i,i] = a^ - ■ ■ aj whenever the choices of positions i and j make sense. This notion 
is extended to infinite words in a natural way. An occurrence of u in rc is a position i such that 
w[i,i + \u\ — 1] = u. If such a position exists, then we say that u occurs in w. 

A positive integer p is a period of zf = uq - ■ ■ a^-i if a, = fl,+p for 0 < i < n — p — 1. If the 
finite word w has period p and \w\/p > a for some real a such that a > 1, then zv is called an 
oi-repetition. An a-repetition is minimal if it does not have an a-repetition as a proper prefix. If 
w = u^, then zv is a square with square root u. A square is minimal if it does not have a square as 
a proper prefix. A word w is primitive if it is of the form z” if and only if n = 1. Equivalently, 
a word w is primitive if and only if w occurs in zv^ exactly twice. The primitive root of w is the 
unique primitive word u such that w = u’^ for some n > 1. Let w = v^ he a periodic infinite 
word. The minimal period ofzv is defined to be the primitive root of v. 

Let w = flo^i ■ ■ ■ (in-i be a word. The reversal tu of ic is the word fl„_i ■ ■ ■ a^aQ. If w = w, then 
we call w a palindrome. Let C be the cyclic shift operator defined by the formula C(flofli ■ ■ ■ ^n-i) = 
fli ■ ■ ■ The words w, C(w), C^(zv),..., C''^^~^{w) are the conjugates ofw. If m is a conjugate 

of w, then we say that u is conjugate to w. 

An infinite word zv is recurrent if each of its factors occurs in it infinitely often. Let {in)n>i be 
the sequence of consecutive occurrences of a factor u in a recurrent word w. The return time of u 
is the quantity 

sup{q+i-q: j e { 1 , 2 ,...}}, 

which can be infinite. The factors zv[ij, q+i — 1],; > 1 are the returns to u in w. If the return time 
of each factor of w is finite, then the word w is uniformly recurrent. Equivalently, w is uniformly 
recurrent if for each factor u of w there exists an integer R such that every factor of w of length 
R contains an occurrence of M. If there exists a global constant K such that the return time of any 
factor M of w is at most K\u\, then we say that w is linearly recurrent. Clearly a linearly recurrent 
word is uniformly recurrent. The index of a factor u of an infinite word w is defined to be 

supju: u” e £{w)}. 

If w is uniformly recurrent and aperiodic, then the index of every factor of w is finite. 

A subshift n is a subset of A^ such that 

n = {weA^-. £{w) C £} 
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for some language £ such that £ C A*. If we set above £ = £(w) where w is an infinite word, 
then we say that the subshift O is generated by w. Subshifts are clearly shift-invariant. If every 
word in a subshift is aperiodic, then we call the subshift aperiodic. A subshift is minimal if it does 
not contain nonempty subshifts as proper subsets. A nonempty subshift is minimal if and only if 
it is generated by a uniformly recurrent word. 

2.1 Optimal Squareful Words 

In [17] Kalle Saari considers a-repetitive words. An infinite word is a-repetitive if every position 
in the word starts an a-repetition and the number of distinct minimal a-repetitions occurring in 
the word is finite. If a = 2, then a-repetitive words are called squareful words. This means that 
every position of a squareful word begins with a minimal square. Saari proves that if the number 
of distinct minimal squares occurring in a squareful word is at most 5, then the word must be 
ultimately periodic. On the other hand, if a squareful word contains at least 6 distinct minimal 
squares, then aperiodicity is possible. Saari calls the aperiodic squareful words containing exactly 
6 minimal squares optimal squareful words. Further, he shows that optimal squareful words are 
always binary and that the six minimal squares must take a very specific form: 

Proposition 2.1. Let w be an optimal squareful word. Jf lO'l occurs in wfor some i > 1, then the roots 
of the six minimal squares in w are 

51 =0, S4 = 10“, 

5 2 = 010“”\ S 5 = 10“+^(10“)^ (1) 

53 = 010“, Se = 10“+^(10“)‘’+\ 

for some a > 1 and b > 0 . 

The optimal squareful words containing the minimal square roots of (1) are called optimal 
squareful words with parameters a and b. For the rest of this paper we reserve this meaning for the 
symbols a and b. Furthermore, we agree that the symbols S; always refer to the minimal square 
roots ( 1 ). 

Saari completely characterizes optimal squareful words [17, Theorem 17]. 

Proposition 2.2. An aperiodic infinite word w is optimal squareful if and only if (up to renaming of 
letters) there exists integers a > 1 and b > 0 such that w is an element of the language 

0*(10“)*(10“+^(10“)‘’ -b 10“+^(10“)‘’+^)“’ = S['S4(S5 -b 56)“". 


2.2 Continued Fractions and Rational Approximations 

In this section we review results on continued fractions and best rational approximations of irra¬ 
tional numbers needed in this paper. Good references on these subjects are the books of Khinchin 
[9] and Cassels [2]. 

Every irrational real number cc has a unique infinite continued fraction expansion 


a = [flo; ^1/ ‘^ 2 , fl 3 / • • •] = flo + 


fll 


fl2 -b 


03 -b ■ 


( 2 ) 


with aQ G Z and aj^ g'N for all A: > 1. The numbers a, are called the partial quotients of oc. We focus 
here only on irrational numbers, but we note that with small tweaks much of what follows also 
holds for rational numbers, which have finite continued fraction expansions. 
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The convergents Cj^ = ^ of a are defined by fhe recurrences 
Po = flO/ Pi = fliflo + 1/ Pk = akVk-1 + Vk-ir k>2, 

% = 1/ Cjl = fll/ = ‘^k‘^k-1 + ‘?/c-2/ fc > 2. 

The sequence {c]()k>o converges fo cc. Moreover, the even convergents are less than a and form 
an increasing sequence and, on the other hand, the odd convergents are greater than a and form 
a decreasing sequence. 

If it > 2 and > 1, then between the convergents Cjt _2 and ci^ there are semiconvergents (called 
intermediate fractions in Khinchin's book [9]) which are of the form 

Pk,l ^ (-Pk-i + Pk-i 

^k,e (■%-! + ‘^k-2 

with 1 < £ < When the semiconvergents (if any) between cjt-i and cjt are ordered by the size 
of their denominators, the sequence obtained is increasing if k is even and decreasing if k is odd. 

Note that we make a clear distinction between convergents and semiconvergents, i.e., conver¬ 
gents are not a specific subtype of semiconvergents. 

A rational number f is a best approximation of the real number a if for every fraction | such 
that f 7 ^ f and d < bit holds that 

\ba. — a\ < \dcc — c\. 

In other words, any other multiple of a with a coefficient at most b is further away from the 
nearest integer than boc is. The next important proposition shows that the best approximations of 
an irrational number are connected to its convergents (for a proof see Theorems 16 and 17 of [9]). 

Proposition 2.3. The best rational approximations of an irrational number are exactly its convergents. 

We identify the unit interval [0,1) with the unit circle T. Let a € (0,1) be irrational. The map 

R : [0,1) —> [0,1), X i--> {x -|- x}, 

where {x} stands for the fractional part of the number x, defines a rotation on T. The circle 
partitions into the intervals (0, j) and (j, 1). Points in the same interval of the partition are said 
to be on the same side of 0 and points in different intervals are said to be on the opposite sides of 
0. (We are not interested in the location of the point j.) The points and ^re always 

on the opposite sides of 0. The points with 0 < £ < a^ always lie between the points 

{qk- 2 ‘>^} and {qk^}; see (4). 

We measure the shortest distance to 0 on T by setting 
||x|| = min{{x},l — {x}}. 

We have the following facts for k>2 and for all I such that 0 < I < ajf 

Pw)/ 

\\%M\ = ll'?W-l“ll - ll'?A:-l'^ll- 

We can now interpret Proposition 2.3 as 

min ||na|| = fork > 1. ( 5 ) 

0<n<qi 

Note that rotating preserves distances; a fact we will often use without explicit mention. In par¬ 
ticular, the distance between the points {nx} and {mx} is \\\n — m|x||. Thus by (5) the minimum 
distance between the distinct points {mx} and (mx) with 0 < n,m < qj^ is at least For¬ 

mula (5) tells what is the point closest to 0 among the points {nx} for 1 < n < q^ — 1. We are 
also interested in knowing the point closest to 0 on the side opposite to {qj^-icc}. The next result 
is very important and concerns this; see [12, Proposition 2.2.]. 


(3) 

(4) 
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Proposition 2.4. Let a be an irrational number. Let n be an integer such that 0 < n < q^ ( with k > 2 
and 0 < £ < aj^. If \\noi\\ < then n = for some integer m such that 1 < m < 

min{t', aj^ — £ + 1}. 

2.3 Sturmian Words 

Sturmian words are a well-known class of infinite, aperiodic binary words with minimal factor 
complexity. They are defined as the infinite words having n + 1 factors of length n for every n > 0. 
For our purposes it is more convenient to view Sturmian words as the infinite words obtained as 
codings of orbits of points in an irrational circle rotation with two intervals; see [14, 11]. Let us 
make this more precise. The frequency a of letter 1 (called the slope) in a Sturmian words exists, 
and it is irrational. Divide the circle T into two intervals Iq and Ii defined by the points 0 and 
1 — a, and define the coding function v by setting v{x) = Oifx € Iq and v{x) = 1 if X e h. 
The coding of the orbit of a point x is the infinite word Sx,x obtained by setting its n*, n > 0, 
letter to equal v(R'‘(x)) where R is the rotation by angle a. This word is Sturmian with slope a, 
and conversely every Sturmian word with slope a is obtained this way. To make the definition 
proper, we need to define how v behaves in the endpoints 0 and 1 — a. We have two options: 
either take Iq = [0,1 — a) and tj = [1 — a, 1) or Iq = (0,1 — a] and tj = (1 — a, 1]. The difference 
is seen in the codings of the orbits of the special points {—na}, and both options are needed to be 
able to obtain every Sturmian word of slope a as a coding of a rotation. However, in this paper 
we are not concerned about this choice. We make the convention that I{x,y) with x y and 
x,y 7 ^ 0 is either of the half-open intervals of T separated by the points x and y (taken modulo 
1 if necessary) not containing the point 0 as an interior point. The interval J(x, 0) = t(0, x) is 
either of the half-open intervals separated by the points 0 and x having smallest length (the case 
X = 2 is not important in this paper). Since the sequence ({wa})„>o is dense in [0,1)—as is well- 
known—every Sturmian word of slope a has the same language (that is, the set of factors); this 
language is denoted by £(a). Further, all Sturmian words are uniformly recurrent. 

For every factor w = oqOi ■ ■ ■ a„_i of length n there exists a unique subinterval [w] of T such 
that Sx,ct begins with w if and only if x e [ic]. Clearly 

[zv] = la, n R-i(t«J n... n 

We denote the length of the interval [w] by | [ro] |. The points 0, {—a}, {—2a},..., {—na) partition 
the circle into n - 1-1 intervals, which have one-to-one correspondence with the words of £(a) of 
length n. Among these intervals the interval containing the point { —(n -|- l)a} corresponds to 
the right special factor of length n. A factor w is right special if both wO, wl e £(a). Similarly a 
factor is left special if both Ow, Iw € £(a). In a Sturmian word there exists a unique right special 
and a unique left special factor of length n for all u > 0. The language £(a) is mirror-invariant, 
that is, for every w G £(a) also w G £(a). It follows that the right special factor of length n is the 
reversal of the left special factor of length n. Sturmian words are also balanced; that is, the number 
of occurrences of the letter 1 in any two factors of the same length differ at most by 1 . 

Given the continued fraction expansion of an irrational a G (0,1) as in (2), we define the 
corresponding standard sequence (sj-)j->o of words by 

s_i = 1 , so = 0 , si=So'"^s_i, Sfc = k>2. 

As is a prefix of for k > 1, the sequence (sj^) converges to a unique infinite word c^ 
called the infinite standard Sturmian word of slope a, and it equals Scc,a.- Inspired by the notion of 
semiconvergents, we define semistandard words for k > 2 by 

Sk,e = St-lSfc -2 

with 1 < £ < fljt- Clearly |sjt| = qk and |sjt/| = qk,e- Instead of writing "standard or semistan¬ 
dard", we often simply write "(semi)standard". The set of standard words of slope a is denoted 
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by Stand{oi), and the set of standard and semistandard words of slope a is denofed by Stand'^(ix). 
(Semi)sfandard words are leff special as prefixes of fhe word c^. Every (semi)sfandard word is 
primitive [11, Proposition 2.2.3]. An imporfant properly of sfandard words is fhaf fhe words 
and S](-i almosf commufe; namely SytSfc-i = and = wyx for some word w and disfincf 

letters x and y. For more on sfandard words see [11,1]. 

The only difference between fhe words c« and c-^ where a = [0; 1, a 2 , a^, ■ ■.] and a = [0; ^2 + 
1, fl 3 ,...] is fhaf fhe roles of fhe letters 0 and 1 are reversed. We may fhus assume wifhouf loss 
of generalify fhaf fli > 2. For fhe resf of fhis paper we make fhe convention fhat a sfands for an 
irrational number in ( 0 , 1 ) having the continued fraction expansion as in ( 2 ) with > 2 , i.e., we 
assume that 0 < a < j. The numbers and (jj- ^ refer fo fhe denominators of fhe convergenfs of 
a, and fhe words Sj^ and Sjt £ refer fo fhe sfandard or semisfandard words of slope cc. 

2.4 Powers in Sturmian Words 

In fhis section we review some known resulfs on powers in Sfurmian words, and prove helpful 
resulfs for fhe nexf section. 

If a square occurs in a Sfurmian word of slope a, fhen fhe lengfh of fhe word w musf be 
a really specific number, namely a denominafor of a convergenf or a semiconvergenf of a. The 
proof can be found in [3, Theorem 1] or [12, Proposition 4.1]. 

Proposition 2.5. ^ nonempty and primitive, then |zi;| = qo, |m;| = qi or |a;| = qj^^£ 

for some k>2 with 0 < (. < a^. 

Next we need to know when conjugates of (semi)sfandard words occur as squares in a Sfur¬ 
mian word. 

Proposition 2.6. The following holds: 

(i) A factor w E £(a) is conjugate to s^for some k >0 if and only if\w\ = |si-| and w^ € £(a). 

(ii) Let whe a conjugate ofs^^£ with k>2 and 0 < i < a^. Then w^ E jC{a.) if and only if the intervals 
[zf] and [sjt/j have the same length. 

(in) Let n = qo, n = qi, orn = q^^f with k>2 and 0 < £ < %/ s be the (semi)standard word of 

length n. A factor w E jC.{a) of length n is conjugate to s if and only ifw and s have equally many 
occurrences of the letter 0. 

Proof. Claim (i) is a direct consequence of [3, Theorem 3] or alternatively [12, Theorem 4.5]. Claim 
(ii) can be inferred from Theorems 4.3 and 4.5 of [12]. Finally, claim (Hi) is evident from fhe proof 
of [12, Theorem 4.3], buf a short proof can be given: fhe idea is fhaf every factor of lengfh n excepf 
one exceptional factor v is conjugate fo s since s^ occurs in £(a) by (i) and (ii). As nof every facfor 
of lengfh n may have fhe same number of leffers 0 (a righf special facfor always exfends fo two 
factors having different number of letters 0 ), it must be that v has a different number of letters 0 
than any conjugate of s. □ 

We also need to know the index of certain factors of Sturmian words. The following proposi¬ 
tion follows directly from Theorems 3 and 4 of [3] or from [12, Theorem 4.5]. 

Proposition 2.7. The index of the standard word sj- in C{a) is a^_^i + 2 for k >2 and a 2 + Ifor k = 1. 
The index of the semistandard word s^/ in L{(x.) with k >2 and 0 < i < aj^ is 2. 
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3 The Square Root Map 


In [17] Saari observed that every Sturmian word with slope a = [0; fli, a 2 ,...] is an optimal square¬ 
ful word with parameters a = fli — 1 and b = a 2 — 1. The assumption 0 < a. < j implies that 
0^ G £(a), so the six minimal squares in £(a) are the same as in (1). In particular, Saari's result 
means that every Sturmian word can be (uniquely) written as a product of fhe six minimal squares 
of slope cc (1). Thus fhe square roof map infroduced nexf is well-defined. 

Definition 3.1. Let s be a Sturmian word with slope cc and factorize it as a product of minimal 
squares s = ■ ■ ■ . The square root ofs is fhen defined fo be fhe word y/s = X 1 X 2 X 3 ■ ■ ■. 

Lef us consider as an example fhe famous Fibonacci word /. The Fibonacci word is a Sfurmian 
word of slope [0; 2,1,1,...], so if has paramefers a = 1 and b = 0. If is also fhe fixed poinf of fhe 
subsfifufion 0 i-->- 01,1 1 -->- 0. For more informafion, see for insfance [11]. We have fhaf 

/ = ( 010 )^( 100 )^( 10 )^( 01 )^ 0 ^( 10010 )^( 01)2 ■ ■ ■ and 
v 7 = 010 ■ 100 ■ 10 ■ 01 ■ 0 ■ 10010 ■ 01 ■ ■ ■. 


Nofe fhaf a square roof map can be defined for any optimal squareful word. However, now 
we only focus on Sfurmian words; we sfudy lafer fhe square roof map for ofher opfimal squareful 
words in Secfion 8 . 

We aim fo prove fhe surprising facf fhaf given a Sfurmian word s fhe word y/s is also a Sfur¬ 
mian word having fhe same slope as s. Moreover, knowing fhe infercepf of s, we can compufe 
fhe infercepf of y/s. 

In fhe proof we need a special function !/^ : T —)• T defined as follows. For x e (0,1) we sef 
xp{x) = i(x-hl-a), 
and we sef 

ifJo = [0,l-«), 

\l-f, if fo = (0,1-a]. 


The mapping ip moves a poinf x on fhe circle T fowards fhe poinf 1 — a by halving fhe disfance be- 
fween fhe poinfs x and 1 — a. The disfance fo 1 — a is measured in fhe inferval 1q or li depending 
on which of fhese infervals fhe poinf x belongs fo. 

We can now sfafe fhe resulf. 


Theorem 3.2. Let Sx,a he a Sturmian word with slope cc and intercept x. Then y/sffc = ,j. In 

particular, y/Sx,a is a Sturmian word with slope cc. 

For a combinatorial version of fhe above fheorem see Theorem 6.5 in Secfion 6 . 

The main idea of fhe proof is fo demonsfrafe fhaf fhe square roof map is acfually fhe symbolic 
counferparf of fhe function ip. We begin wifh a definition. 

Definition 3.3. A square w^ G £('^) satisfies fhe square root condition if ip{[w^]) C [a;]. 

Nofe fhaf if fhe inferval [ic] in fhe above definition has 1 — a as an endpoinf, fhen w aufomaf- 
ically safisfies fhe square roof condition. This is because tp moves poinfs fowards fhe poinf 1 — a 
buf does nof map fhem over fhis poinf. Acfually, if w safisfies fhe square roof condition, fhen 
necessarily fhe inferval [w] has 1 — a as an endpoinf (see Corollary 4.3). 

We will only skefch fhe proof of fhe following lemma. 

Lemma 3.4. For every i G {1,... ,6} the minimal square root S, of slope cc satisfies the square root 
condition and ^{{x -|- 2|S,|a}) = {^{x) -|- |S, |a}/or all x G [S?]. 
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Figure 1: The positions of the intervals on the circle in the proof sketch of Lemma 3.4. 


Proof Sketch. It is straightforward to verify that 


[51] = 1(0,1-a), 

[5 2 ] =t(- 2 a,l-a), 

[5 3 ] =t(-2a,l-a). 


[5 4 ] =t(l-a,l), 

[5 5 ] = I{1- oc,-q 2 ,iOc), 
[Se] = t(l - oc,-q2,lOi) 


and 


[S?] = 1(0,-2a), [S^] = t(-^ 2 ,i«,l), 

[52] =-f(-('?2,i + l)a/l-a)/ [S5] =t(l-a,-((?3j+l)a), 

[5 3 ] = ^(-20;, -((j 2 ,i + l)a)/ [Se] = ^(-('?3,i + !)«/ -qi.iOf), 


see Figure 1 . Since xp does not map points over the point 1 — a, it is evident that every minimal 
square root satisfies the square root condition. 

Consider then the latter claim. Let i G {1,..., 6 }. Suppose that x e [S 2 ]\{ 0 },{x + 2 lS,la} f 
0, and [x + 2]S,jaJ = 2r for some r > 0. Then 

1 

!/’({x + 2jS,ja}) = -(x + 2jS/ja — 2r + 1 — a) = ip{x) + \Si\oc — r = {ip{x) + jS/ja} (6) 

since ip is a function from T to T. We consider next the cases f = 1 and i = 5; the other cases are 
similar. 

Suppose that S, = Si. Now x + 2a > 2a > 0 = 2po and x + 2a<l — 2a + 2a = l = 2po + 1/ 
so X + 2a G (2po/ 2po + !]■ The claim is thus clear as in ( 6 ) if x 7 ^ 0 and x 7 ^ 1 — 2a. If x = 0, then 
Jo = [0,1 — a) and {xp{x) + a} = { 2(1 — a) + a} = ^(1 + a) = xp{{x + 2a}). If x = 1 — 2a, then 
Jq = ( 0 , 1 — a] and ip{{x + 2 a}) = 1 — f = + «}■ 

Assume then that S, = S 5 . Note that jSsj = q 2 . Using (4) we obtain that 

X + 2q2a < ]] ((^34 + l)a|l + 2q2X 

= 1 — a + IJi^spalj + 2p2 + 2]j(j2all 
= 1 -a+\\qioc\\ - \\q2a\\ + 2p2 + 2ll(j2a|l 
= 1 - a + |l(?ia|j + \\q2‘>‘-\\ + 2p2 
< 2 p 2 + 1 / 


where equality holds only if x = Ifqs^i + l)x]j and 02 = 1. The length of the interval [S|] is 
|li^ 34 ajj. Since 1 — a > a + ll(jia|j and a > |l(?ia|j > | 1 (? 2 ®I 1 / h follows from the preceding inequal¬ 
ities that x + 2 (^ 2 * > 2 p 2 . Therefore X-|- 2 (^ 2 ^ G ( 2 p 2 , 2 p 2 -|-1]. If fl 2 > 1 or x 7 ^ |j (<^ 31 -|-l)a|j, 
then the conclusion follows as in ( 6 ). Suppose finally that 02 = 1 and x = | 1(<?34 -F l)x|l. Now 
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Jo = (0,1 — a], so ^{{x + 2q2x}) = !/’(0) = 1 — f ■ On the other hand, 
1 

ip{x)+q2‘>i = -(1 -a + ||(?3,ia|| + 1 - a) + P2 + Ik2a|| 

1 

= ;r(l - a + ||(jia|| - ||^2a|| + 1 - a + 2||(j2a||) + pi 


— 1 “ 2 

so the conclusion holds also in this case. 


□ 


Proof of Theorem 3.2. Write Sx 


= xlxlxl 


as a product of minimal squares. Since the minimal 


square X^ satisfies fhe square roof condifion by Lemma 3.4, we have fhaf xp{x) E [Xj ]. Hence bofh 
/Sx,a and s^(x)^a begin wifh Xi. Lemma 3.4 implies fhat f{{x + 2|Xi|a}) = {f{x) + |Xi|a} for 


^ip{x) 

all X G [Xj ]. Thus by shiffing Sx,a the amount 21 Xj | and by applying the preceding reasoning, 
we conclude that s.^(^x),a shifted by the amount |Xi| begins with X 2 . Therefore the words 
and s^(^x),i)c agree on their first |Xi | + IX 2 I letters. By repeating this procedure, we conclude that 

-v/ Sx.a. ~ /v- n 


Theorem 3.2 allows us to effortlessly characterize the Sturmian words which are fixed points 
of the square root map. 


Corollary 3.5. The only Sturmian words of slope a which are fixed by the square root map are the two 
words OlCa and lOca, both having intercept 1 — a. 

Proof. The only fixed point of the map \p is the point 1 — a. Having this point as an intercept, we 
obtain two Sturmian words: either OlCa or lOca, depending on which of the intervals Jq and fi 
the point 1 — a belongs to. □ 

The set {OlCa, lOca} is not only the set of fixed points but also the unique attractor of the 
square root map in the set of Sturmian words of slope a. When iterating the square root map on 
a fixed Sturmian word Sx,a, the obtained word has longer and longer prefixes in common with 
either of the words OlCa and lOca because f’^{x} tends to 1 — a as n increases. 


4 One Characterization of Words Satisfying the Square Root Condition 

In the previous section we saw that the minimal squares, which satisfy the square root condition, 
were crucial in proving that the square root of a Sturmian word is again Sturmian with the same 
slope. The minimal squares of slope a are not the only squares in £(«) satisfying the square root 
condition; in this section we will characterize combinatorially such squares. To be able to state 
the characterization, we need to define 

RStand{oc) = {w: w G Stand{a)}, 

the set of reversed standard words of slope a. Similarly we set 

RStand~^{a.) = {w: w E Stand~^{ix.)}. 

We also need the operation L which exchanges the first two letters of a word (we do not apply 
this operation to too short words). 

The main result of this section is the following. 

Theorem 4.1. A square zv^ G £(«) satisfies the square root condition if and only ifw G RStand~^{ix.) U 
L{RStand{a.)). 
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As we remarked in Section 3, a square e trivially satisfies the square root condition 
if ifs inferval [ic] has 1 — a as an endpoinf. Our aim is fo prove fhaf fhe converse is also frue. We 
begin wifh a fechnical lemma. 

Lemma 4.2. Let n = qi or n = for some k >2 with 0 < Z < fljt/ let i he an integer such that 
1 < i < n. 

(i) If {—ioi} e Iq and { —(Z + n)a} < {—ia}, then f{—{i + n)a) > {—ia}. 

(ii) If {—ioi} e !-[ and { — {i + n)(x} > {—ioi}, then ip{—{i + n)a.) < {—ia}. 

Proof. We prove (i), fhe second asserfion is symmefric. Suppose { — ia} e Iq and { —(Z + n)a} < 
{—ia}. Nofe fhaf fhe disfance between fhe poinfs {—ia} and { —(Z + n)a} is less fhana. If follows 
fhaf {—na} e f. Assume on fhe confrary fhaf f{ — {i + n)a) < {—ia}, fhaf is, 

{-(Z + n)a} + ^({1 - a} - {-(Z + n)a}) < {-ia}. 

Since 0 < { —(Z + n)a} < {—ia}, fhe disfance between { —(Z + n)a} and {—ia} is the same as the 
distance between 1 and {—na}. Thus by substituting { —(Z + n)a} = {—ia} — (1 — {—na}) to the 
above and rearranging, we have that 

{1 — a} — {—Za} < 1 — {—na}. 

Since {—na} e f, we obtain that 

II — (Z — l)a|| < II — na||. (7) 

Suppose now first that n = q^ i for some k > 2 and 0 < Z < flj-. Since Z — 1 < n. Proposition 2.4 
and (7) imply that Z — 1 = mq^_i for some 1 <m < min{Z, a^ — l + 1}. As {—na} e Jj, the point 
{—qk-iOi.} must lie on the opposite side of 0 in the interval Iq. Therefore { —(Z — !)«} 6 Iq. Then 
by (7), the point {—Za} must lie in tj. This is a contradiction. Suppose then that n = q^. It is easy 
to see that (7) carmot hold for any Z greater than 1. This concludes the proof. □ 

Corollary 4.3. Ifw^& >C(a) with w primitive satisfies the square root condition, then the interval [ic] 
has 1 — aas an endpoint. 

Proof. Let n = |a;|. Proposition 2.5 implies that n = qQ, n = qi, or n = qi^q for some k > 2 with 
0 < Z < fljt- Say n = qo = 1. As the only factor of length 1 occurring as a square is 0, the claim 
holds as [0] = Iq = ^(0/1 “ Suppose then that n = qi or n = qj^i. 

Let [lu] = I{—ia,—ia). Then either [w^] = t(—Za, —(; + li^Da) or [w^] = J( —(Z+ |a;|)a, —;a). 
Suppose first that [ic] C Iq. By symmetry we may assume that {-ja} > {— Za}. Now [w!^] = 
[—(Z + |ic|)a, —ja) if and only if j = 1. Namely, if j f 1, then it is clear that it is possible to find 
a point X e I(—ia,-ja) close to {-ja} such that f{x) > {-ja}, so the condition f{[w^]) C [w] 
carmot be satisfied. If [w^] = [—ia,—{j + |ic|)a) and; 1, then by Lemma 4.2 !p( —(;+ > 

{-ja}, so the condition f{[w^]) C [a;] carmot be satisfied. Thus also in this case necessarily 
; = 1. The case where [ic] C f is proven symmetrically using the latter symmetric assertion of 
Lemma 4.2. □ 

Next we study in more detail the properties of squares w^ G >C(a) whose interval has 1 — a as 
an endpoint. 

Proposition 4.4. Consider the intervals of factors in L{a) of length n = qi or n = q^i with k >2 and 
0 < Z < fl/t- Z-st M V be the two distinct words of length n having intervals with endpoint 1 — a. Then 
the following holds. 

(i) There exists a word w such that u = xyw and v = yxw = L{u) for distinct letters x and y. 
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(ii) Either uorv is right special. 

(Hi) If }i is the right special ivord among the words u and v, then jp- e £(«). 

(iv) If A is the word among the words u and v which is not right special, then £ >C(a) if and only if 

n = qi or I = a^. 

Proof. Suppose first that n = q\. Then it is straightforward to see that the factors u and v of 
length n having intervals with endpoint 1 — a are = S 2 and = S 4 . Clearly S 4 is 

right special and L(S 4 ) = S 2 . Moreover S\, £ P,{oc). 

Assume that n = q^i for some k > 2 with 0 < / < fljf By Proposition 2.4 the point {—naj is 
the point closest to 0 on the side opposite to the point {—qk-i^}- Thus either { — {n + l)a} £ [u] 
or { —(n + l)a} £ [i;]. Assume by symmetry that { —(n + l)a} £ [u]. This means that the word u 
is right special, proving (ii). Further, the endpoint of [u] which is not 1 — a must be after a rotation 
the next closest point to 0 on the side opposite to the point {—qk-i<’^}- Thus by Proposition 2.4 
[u] = I{—{qk^i-i + l)a,l — a) and consequently [u] = t(l — a, —{qk-i + l)a). 

Since the points x = {{ — {qk^i-i + l)a} and y = { — {qk-i + l)a} are on the opposite sides 
of the point 1 — a and the points {x + ol] and {y + ol] are on the opposite sides of the point 0, 
it follows that u begins with cd and v begins with dc for distinct letters c and d. Assume on the 
contrary that u = cdzeu' and v = dczfv' for distinct letters e and /. In particular, |z| < n — 3. This 
means that the point x' = {x + (|z| + 2)a} is in [e] and the point y' = {y + (|z| + 2)a} is in [/]. It 
must be that e = c and f = d as otherwise the point x' — oc would be in [c] and the point y' — x 
would be in [d] contradicting the choice of z. Since a is irrational, either x' is closer to 1 — a than 
X or y' is closer to 1 — a than y. 

Suppose that x' is closer to 1 — a than x. Since x' is on the same side of the point 1 — a as x, it 
follows that 

||x' + a|| = \\{qk,i-i - |z| -2)a|| < \\qk,i-ix\\ = ||x: + a||. 

Since qk,i-i “ |z| — 2 < qk,i-i, by Proposition 2.4 it must be that qk,i-i — |z| — 2 < 0 . However, 
IIII = II “ ‘?)c,/-i“l|/ it follows by Proposition 2.4 that |z| + 2 — qk,i-i = ^cjk-i for some 
m > 1 . Thus |z| + 2 > qk,i-i + qk-i = dk,l = This is, however, a contradiction as |z| < n — 3 . 
Suppose then that y' is closer to 1 — a than y. Similar to above, it follows that 

||y' + a|| = \\{qk-i- |z| - 2)a|| < \\qk-ix\\ = ||y + a||. 

Again, it must be that qk-i ~ |z| — 2 <0. Since ||i^i-_ia|| = || — qk-ix\\, it follows from (5) that 
|z| + 2 — qk-\ > qk- Therefore |z| + 2 > yj- + %-i > This is again a contradiction with the fact 
that |z| < n — 3. 

Thus we conclude that u = cdw and v = dew for some word w proving (i). As n = qkp 
it must be that the right special word of length n equals Sjt,;- Since u and v are conjugate by 
Proposition 2.6 (iii). Proposition 2.6 implies that if I = ak, then iP, ip £ L[x). Suppose that I f ak. 
By Proposition 2.6, the word Ski occurs as a square in £(a). Since £(a) is mirror-invariant, also 
= s^i £ £(a). Therefore from Proposition 2.6 it follows that |[m]| = ||y;t,/-i®ll = |[s£/]|- 
Now [u] = J(1 — a, —(y/t-i + !)«)/ so |[u]| = ||yi;_ia|| 7 ^ |[m]|. Thus Proposition 2.6 implies that 
£(a). Hence (iii) and (iv) are proved. □ 

Proof of Theorem 4.1. If |i(;| = 1, then clearly w = 0 = sq/ so the claim holds. We may thus focus 
on the case that |w| > 1 . 

Suppose that w^ £ £(a) satisfies the square root condition. By Corollary 4.3 the interval [ic] 
has 1 — a as an endpoint. Moreover, Proposition 2.5 implies that |a;| = qi or |w| = qki for some 
k > 2 with 0 < I < ak. Thus from Proposition 4.4 it follows that w = Jor w = L(s ) where s 
is the (semi)standard word of length |ii;|. By Proposition 4.4 we have that £ C{x). Moreover, 
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by Proposition4.4 we have that L(s)^ G if only if |h;| = q^- for some k > 1. Thus 

zv G RStand~^{a) U L{RStand{a)). 

Suppose fhen fhaf h; G RStand\oc)UL{RStand[a.)). Nofe firsffhaf L(zi;) has fhe same number 
of leffers 0 as w, so w is conjugafe fo L{w) by Proposifion 2.6. Thus if follows from Proposition 2.6 
fhaf G £(«)■ Lef u and v be fhe factors of lengfh |rf | having endpoinf 1 — a. By Proposifion 4.4 
fhe word u musf be righf special and v = L{u). Since fhe righf special factor of lengfh |ic| is 
unique, eifher a; = m or L{zv) = u. Thus fhe inferval [w\ has 1 — a as an endpoinf. Then clearly 
w'^ satisfies fhe square roof condifion. □ 

5 Characterization by a Word Equation 

If fums ouf fhaf fhe squares of slope a satisfying fhe square roof condifion have also a differenf 
characferizafion in terms of specific solutions of fhe word equation 

XlXl---Xl = {X^X2---Xnf ( 8 ) 

in fhe language £(a). We are interested only in fhe solufions of ( 8 ) where all words X, are minimal 
square roots (1), i.e., primitive roofs of minimal squares. Thus we give fhe following definition. 

Definition 5.1. A nonempty word w is a solution to ( 8 ) if zv can be written as a producf of minimal 
square roofs zv = XiX 2 - ■ ■ X„ which safisfy fhe word equation ( 8 ). The solution is trivial if Xj = 
X 2 = ... = X„ and primitive if zv is primifive. The word a; is a solution to ( 8 ) in L{oi) if zv is a 
solution fo ( 8 ) and zv^ G £(«)• 

All minimal square roofs of slope a are frivial solufions fo ( 8 ). One example of a nonfrivial 
solufion is w = S 2 S 1 S 4 in fhe language of fhe Fibonacci word (i.e., in fhe language of slope 
[0;2,1,1,...]) since zv^ = (01010)^ = (01)^ ■ 0^ ■ (10)^ = S^SjS^. Note fhaf in fhe language of any 
Sfurmian word fhere are only finifely many frivial solufions as fhe index of every facfor is finife. 

Nofe fhaf fhe facforizafion of a word as producf of minimal squares is unique. Indeed, if 
Xj ■ ■ ■ X^ = ■ ■ ■ Ym, where fhe squares X? and Yf are minimal, fhen eifher X^ is a prefix of 

Yj or vice versa. Therefore by minimalify X^ = Y^, fhaf is, Xj = Y^. The uniqueness of fhe 
facforizafion follows. 

Our aim is fo complefe fhe characferizafion of Theorem 4.1 as follows. 

Theorem 5.2. Let zv G £(a). The following are equivalent: 

(i) w is a primitive solution to (8) in £(a), 

(ii) satisfies the square root condition, 

(Hi) zv G RStand^{ix) U L{RStand{ix)). 

For lafer use in Secfion 8 we define fhe language £(a, b). 

Definition 5.3. The language £(a, b) consists of all facfors of fhe infinife words in fhe language 

( 10 a+l(ioa)b ^ ;^oa+l(;^oO)b+l)a; ^ ^ 5 ^) 0 ,^ 

Observe fhaf by Proposifion 2.2 every facfor in £(a, b) is a facfor of some optimal squareful 
word wifh parameters a and b. Moreover, if a = [0; a + 1, b + 1,...], fhen £(a) C £(a, b). 

Definition 5.4. The language TI(a, b) consists of all nonempfy words in £(a, b) which can be 
written as producfs of fhe minimal squares ( 1 ). 
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Let w 6 n(a, b), that is, w = for minimal square roots X,. Then we can define fhe 

square roof of w by seffing ^/zv = Xi ■ ■ ■ X„. 

We need fwo fechnical lemmas. Their proofs are sfraighfforward case-by-case analysis. The 
sfafemenf of Lemma 5.5 has a fechnical condifion for lafer use in Secfion 8 , which is perhaps bef- 
fer undersfood if fhe reader firsf reads fhe proof of Lemma 5.6 up fo fhe poinf where Lemma 5.5 
is invoked. 

Lemma 5.5. Let u and v be words such that 

• u is a nonempty suffix of S(„ 

• > iSsSel, 

• V begins with xy for distinct letters x and y, 

• uv E £(a, b) and L{v) E £(a, b). 

Suppose there exists a minimal square X^ such that |X^| > |m| and X^ is a prefix ofuv or uL(v). Then 
there exist minimal squares ..., such that X^ and Yj ■ ■ ■ are prefixes of uv and uLiv) of the 
same length and X = Yj ■ ■ ■ Y„. 

Proof. Lef be a minimal square such fhaf |Z^| > |u| and Z^ is a prefix of uv or uL{v). If is 
nof obvious af fhis poinf fhaf Z exisfs buf ifs exisfence becomes evidenf as fhis proof progresses. 
By symmefry we may assume fhaf Z^ is a prefix of uv. To prove fhe claim we consider differenf 
cases depending on fhe word Z. 

Case A. Z = Si = 0. Since m is a nonempfy suffix of S 5 and |Z^| > |u|, it must be that 
u = 0. As V begins with 0, we have that v begins with 01 by assumption. Since v E C{a, b) and 
> iSel, the word v begins with either 010“10“ or 010“+^10“. In the latter case L{v) would 
begin with contradicting the assumption L{v) E £(a, b). Hence v begins with 010“10“. 

It follows fhaf uv has 0010“10“ as a prefix, fhaf is, uv begins wifh SjS|. On fhe ofher hand, fhe 
word mL(i?) has fhe word S 3 = 010“+^10'^ as a prefix. Since S 3 = S 1 S 4 , fhe conclusion of fhe claim 
holds. 

Case B. Z = S 2 = 010““^. If u = 0, fhen v has 10“10'^ as a prefix and, consequenfly, L{v) has 
lOa-ifon ^ prefix confradicfing fhe facf fhaf L{v) E £(a, b). Therefore by fhe assumpfions fhaf 
u is a nonempfy suffix of Se and |Z^| > |m|, if follows fhaf u = 010'^. Thus v has 10'^ as a prefix. 
Using fhe facf fhaf L{v) E £(a, b), we see fhaf v begins wifh and L{v) begins wifh 010“. 
Hence uv has S^Sj as a prefix, and uL(v) has S 3 as a prefix. Since S 2 S 1 = S 3 , we conclude, as in 
fhe previous case, fhaf fhe conclusion holds. 

Case C. Z = S 3 = 010“. Using again fhe facf fhaf m is a suffix of Sg and |Z^| > \u\, we see fhaf 
eifher m = 0 or u = 010“. In fhe firsf case v begins wifh 10“'''^10“ and L{v) begins wifh 010“10“. 
Hence fhe word uL{v) has S^S^ as a prefix. As S 1 S 4 = S 3 , fhe conclusion follows. Lef us fhen 
consider fhe ofher case. Now L{v) begins wifh 10“''‘^, so fhe word uL{v) has S^Sj as a prefix. 
Again, fhe conclusion follows since S 2 S 1 = S 3 . 

Case D. Z = S 4 = 10“. Now fhe only option is fhaf u = 10“. Using fhe facf fhaf v E £(o, b), 
we see fhaf v cannof begin wifh 10“1, so v musf have 10“'^^ as a prefix. Furfher, since bl > ISel, 
if musf be fhaf Se is a prefix of v. If Sel would be a prefix of v, fhen fhe word L{v) would have 
fhe word (10“)^’*'^1 as a factor confradicfing fhe facf fhaf L[v) E £(a, b). Thus S 5 O is a prefix 
of V. Since v E £(a, b) and |zj| > iSsSgl, we have fhaf S60(10“)^+^ = S 5 l 0 “ is a prefix of v. 
Consequenfly, fhe word L{v) begins wifh 0(10“)^“'"^10“'^^(10“)^“*"^, so uL{v) has Sg as a prefix. 
Assume firsf fhaf b is odd. If is sfraighfforward fo see fhaf in fhis case 

0 (i 0 “)bl 0 “+l(i 0 “)b+l = ^g2^{b+l)/2g2^g2^(b+l)/2^ 

Thus for fhe prefix 10 “S 5 l 0 “ of uv we have fhaf 

10“S§10“ = S|(S|)(‘’+i)/ 2 s 2 ( 52 ^(b+i)/ 2 ^ 
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As Se = the conclusion follows as before. Assume fhen fhaf b is even. If is 

now easy fo show fhaf 

0 (i 0 O)bi 0 t>+i(i 0 “)‘’+i = 

Therefore 

10“S§10“ = S|(Sf)‘’/2s2(s2)b/2^ 

Since Se = S 4 S 2 '^^S 3 S 4 ^^, fhe conclusion again follows. 

Case E. Z = Ss = 10“+^(10“)^ Now eifher w = 10“ or m = 10“+^(10“)‘'+^ In fhe 
firsf case v musf begin wifh 0(10“)*’10“+^(10“)^. However, fhis implies fhaf L{v) begins wifh 
20ti+i(io“)*’“^10“+^(10“)*’ confradicfing fhe facf fhaf L{v) € £(a, b). Consider fhen fhe laffer 
case where v begins wifh 0(10“)*^. As L{v) e £(o, b) and |i;| > iSg], if musf be fhaf L{v) begins 
wifh Hence fhe word uL(v) has Sg as a prefix. Since fhe word v begins wifh 

0 (ioa)b+ 2 ^ fhe word uv has S|S| as a prefix. The conclusion follows as S 5 S 4 = S 5 . 

Case F. Z = Se = Now fhere are fwo possibilifies: eifher u = 10“ or 

u = 10“'^^(10“)^’*’^. In fhe firsf case v begins wifh 0(10“)^''’^10“'*'^(10“)^'*'^, so L{v) begins wifh 
10“+i(i0“)i’l0“+^(10“)^+^. The word uL{v) has S|0(10“)^10“''’^(10“)^’*'^ as aprefix. Proceeding 
as in fhe Case D depending on fhe parify of b, we see fhaf fhe conclusion holds. Consider fhen 
fhe laffer case u = 10“+^(10“)^''‘^. The word v musf begin wifh u, so L{v) has 0(10“)^+^ as a 
prefix. Clearly fhe word uL{v) has S|S| as a prefix. As Se = S 5 S 4 , fhe conclusion follows. □ 

A more infuifive way of sfafing Lemma 5.5 is fhaf under fhe assumptions of fhe lemma swap¬ 
ping fwo adjacenf and disfincf letters which do nof occur as a prefix of a minimal square affecfs 
a producf of minimal square only locally and does nof change ifs square roof. 

Lemma 5.6. Let w be a primitive solution to ( 8 ) having the word 85 = 10“'^^(10“)‘’''‘^ as a suffix such 
that w^, L{w) e £(a, b). Then wL(w) e n(o, b) and a/ wL{w) = w. 

Proof. If re = Sg, fhen if is easy fo see fhaf wL{w) = S 5 S 4 and w = S 5 S 4 , so fhe claim holds. We 
may fhus suppose fhaf Sg is a proper suffix of w. 

Since w is a solution fo ( 8 ), we have fhaf w^ = and ic = X 4 ■ ■ ■ X„ for some minimal 

square roofs X,. If musf be fhaf n > 1 as if n = 1 fhen iv = Xi, and if is nof possible for Sg fo 
be a proper suffix of w. Assume for a confradiefion fhaf Xj = Si. Since X 1 X 2 is a prefix of w^, if 
follows fhaf X 2 begins wifh fhe leffer 0. If X 2 7 ^ Si, fhen X 1 X 2 begins wifh 001 buf X^X^ begins 
wifh 000, which is impossible. Hence X 2 = Si, and by repeating fhe argumenf if follows fhaf 
Xj- = Si for all k such fhaf \ < k < n. Thus zv cannof have Sg as a suffix, so we conclude fhaf 
Xi Si. Hence w always begins wifh 01 or 10. 

We show fhaf jX^I < |a;|. Assume on fhe confrary fhaf \Xf\ > |ic|. Since w has fhe word 
Sg as a suffix, if follows fhaf Sg is a factor of X^. If follows fhaf Xi is one of fhe words S 5 , Sg 
or S 3 (if b = 0). If Xi = S 5 , fhen Sg occurs in Xj = 10'’+^(10“)^10“+^(10'’)^ only as a prefix. 
Thus w = Sg confradicfing fhe facf fhaf Sg is a proper suffix of w. If Xi = Sg, fhen Sg occurs 
in X 4 = 10“+^(10“)^“'"^10“+^(10“)^“'"^ as a prefix and as a suffix. Since w 7 ^ Sg, if musf be fhaf 
w = X^ confradicfing fhe primifivify of w. Lef finally b = 0 and Xi = S 3 . Then Sg occurs in 
X 4 = 010““*’^10“ as a suffix. Hence w = X^ confradicfing again fhe primifivify of w. 

Now fhere exisfs a maximal r such fhaf 1 < r < n and X^ ■ ■ ■ X^ is a prefix of w. Acfually 
X 4 ■ ■ ■ X^ is a proper prefix of w, as ofherwise w^ = (X^ ■ ■ ■ X^)^ = (Xi ■ ■ ■ X^Xi ■ ■ ■ X^)^, so 
w = (Xi ■ ■ ■ Xr)^ confradicfing fhe primifivify of w. Thus when factorizing wL(w) and w^ as 
produefs of minimal squares, fhe firsf r squares are equal. Lef u be fhe nonempfy word such fhaf 
zv = X 4 ■ ■ ■ X^u. By fhe definifion of fhe number r, we have fhaf m is a proper prefix of X^^j. 
Suppose for a confradiefion fhaf \u\ > |Sg|. If follows fhaf u has Sg as a proper suffix. This 
leaves only fhe possibilifies fhaf X,.+i is eifher of fhe words S5 or Sg. However, if X,.+i = S5, 
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then Se cannot be a proper suffix of u, and if = Se, fhen r is nof maximal. We conclude fhaf 

\u\ < iSel- 

Nexf we show fhaf w musf safisfy |if| > iSsSgj. Suppose firsf fhaf iv begins wifh fhe leffer 
0. Then as is a proper suffix of zu and e jC.{a, b), if musf be fhaf zv begins wifh 0(10“)^“'"^. 
Suppose fhaf fhis prefix overlaps wifh fhe suffix S 5 . Then clearly zv = 0(10“)(10“)®’+^ = 
(0(10“)*’+^)^ confradicfing fhe primifivify of zv. If fhe prefix 0(10“)*’+^ does nof overlap wifh fhe 
suffix Se, fhen | > jSsSel- Assume fhen fhat zv begins with the letter 1. Similar to above, the 
word a; must begin with In this case necessarily |a;| > jSsSel. 

Finally, we can apply Lemma 5.5 to the words u and zv with X = We obtain minimal 

squares Yf ,..., such that Y^ ■ ■ ■ is a prefix of uL{zv) and and Y\- ■ - Ym = X,.+i ■ ■ ■ X^+t for 
some f > 1. Thus 

zvL{zv) =X\--- X]yI ■ ■ ■ Y^X^^f+i ■■■Xl and 

w = Xi ■ ■ ■ X„ = Xi ■ ■ ■ X,Yi ■ ■ ■ Y^X,+f+i ■ ■ ■ X„. 

The claim is proved. □ 

Proposition 5.7. Let zv e RStand~^(ix) U L{RStand{a)). Then the zvord zv is a primitive solution to (8) 
in C{ol). 

Proof. Note that Proposition2.6 implies that zv^ € £{oc). Suppose first that |h;| < jSel where 
Se = ? 3 p = 10“"''^ (10“)^+^. Clearly the minimal square root Si, ...,85 are solutions to ( 8 ), so 
we are left with the case where zv = §21 = 0(10“)^ for some i such fhat 1 < i < b + 1. It is 
straightforward fo see fhaf if £ is even, then 

zv^ = (S 2 )^^^Sf( 84 )^^^ and zv = S^^^SiS^^^. 

If £ is odd, fhen 

^2 ^ (s 2 ^(t+l)/ 2 s 2 (s 2 ^(^+l )/2 ^ ^ 5 h+l)/ 2 s^s(Nl)/ 2 _ 

Hence re is a solution fo ( 8 ). 

We may fhus suppose fhaf |if | > [Sel, so w has Se as a suffix. We proceed by induction. Now 
eifher zv = s^^^ for some fc > 3 wifh 0 < £ < or L{zv) = 'sj^ for some k >3. We assume fhaf fhe 
claim holds for every word safisfying fhe h 5 rpofheses which are shorfer fhan zv. Consider firsf fhe 
case zv = for some k > 3 wifh 0 < £ < ai^. By fhe facf fhaf S)t-iSfc -2 = we obfain 

thaf 


ZV^ = Sfc_2s/_iSt_2s/_i = Sk_2s[_lL{Sk-2)s[_l ' sl_^ = Sfc,^_iL(s;t,^-l) ' 

Now if fc = 3 and £ = 1, fhen fhe conclusion holds as S 31 = S 5 is a minimal square roof. Hence 
we may assume fhaf eifher k > 3 or k = 3 and £ > 1. Since ?i^_i is a solution fo (8), we have fhaf 
s ^_4 = X 4 ■ ■ ■ X^ and ?jt_i = Xi ■ ■ ■ X„ for some minimal square roofs X,. In ofher words, 

sf_i e n(o, b) and = ?,t_i. 

Since 1% £_i | > |Se |, with an application of Lemma 5.6 we obfain fhaf 
Sk,e-lL(Sf:/-l) E n(a,b) and ys;t,^-li(sj:/-l) =S;t/_l. 

Thus zv^ E n(o, b) and 
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so h; is a solution to ( 8 ). 

Consider next the case w = L(s^) for some k > 3. Similar to above, 

IV^ = L(Sjt_2)Sfc"^iL(S/c-2)4“-l = i'(Sfc-2)4-l^S-t-2Sfc-l^ 

= ^Sk-2)h-lSk-3Sk-2%'Si\-2Sk'L~i^ = L(Sk-2)Sk-lSk-3h-2'~^ 

= ^k-\^k-2^k-3^k-2 ' ~ ^k-l^{Sk-l) ' ^k,ai^-l- 

li k > 3, then the claim follows using the induction hypothesis and Lemma 5.6 as above. In the 
case k = 3 we have that 

Sjt_iL(s;t_i) e n(a,b) and = s^.i. 

Namely, it is not difficult to see that if b is even, then 
Sk-iL{s^-i) = and Sfc_i = 

If b is odd, then 

= (Si)(‘’+l)/2s2(s2)(b-l)/2 ~ ^ g(b+l)/2g^g(b-l)/2_ 

Thus w is a solution to ( 8 ) also in the case k = 3. □ 

Note that a word w in the set L{RStand'^{oi)) \ L{RStand{a.)) is a solution to ( 8 ) but not in the 
language £{oc). Rather, w is a solution to ( 8 ) in C{^) where /5 is a suitable irrational such that 
L{w) is a reversed standard word of slope /5. 

From Proposition 5.7 we conclude the following interesting fact: 

Corollary 5.8. There exist arbitrarily long primitive solutions of (8) in L[oi). 

We can now prove Theorem 5.2. 

Proof of Theorem 5.2. By Proposition 5.7 and Theorem 4.1 it is sufficient to prove that (i) implies 
(ii). 

Suppose that a; is a solution to (8) in L{ol). Write as a product of minimal squares: = 

■ ■ ■ X^. Let X G [w^]- Then the word Sj^ci begins with X^X^ ■ ■ ■ X^, so by Theorem 3.2 the 
word y/Sx,ct = begins with X 1 X 2 ■ ■ ■ X„. Therefore xp{x) e [X 1 X 2 ■ ■ ■ X„] = [ic]. Thus zv^ 

satisfies the square root condition. □ 

6 A More Detailed Combinatorial Description of the Square Root Map 

Recall from Section 3 that the square root ^ of a Sturmian word s has the same factors as s. The 
proofs were d 5 mamical; we used the special mapping xp on the circle. In this section we describe 
combinatorially why the language is preserved; we give a location for any prefix of -y/s in s. As a 
side product, we are able to describe when a Sturmian word is uniquely factorizable as a product 
of squares of reversed (semi)standard words. 

Let us begin with an introductory example. Recall from Section 3 the square root of the Fi¬ 
bonacci word /: 

/ = (010)^(100)2(10)2(01)202(10010)2(01)2 ■ ■ ■ , 
v7 = 010 ■ 100 ■ 10 ■ 01 ■ 0 ■ 10010 ■ 01 ■ ■ ■. 

Obviously the square root Xi = 010 of (010)^ occurs as a prefix of /. Equally clearly the word 
010 ■ 100 = \/(M0p(T00)^ occurs, not as a prefix, but after the prefix Xj of /. Thus the position of 
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the first occurrence of 010 ■ 100 shiffed |Xi | =3 posifions from fhe position of fhe firsf occurrence 
of Xi. However, when comparing the position of fhe firsf occurrence of a/( 010)^(100)^(10)^ wifh 
the first occurrence of 010 ■ 100, we see fhaf fhere is no furfher shiff. By further inspection, the 
word a/( 010)^(100)^(10)^(01)^02(10010)^ occurs for fhe firsf fime af posifion |Xi| of /. This is 
no longer frue for fhe firsf seven minimal squares; fhe firsf occurrence of X 1 X 2 = 010 ■ 100 ■ 10 ■ 
01 ■ 0 ■ 10010 ■ 01 is af posifion IX 1 X 2 I = 16 of /. The amounf of shiff from fhe previous posifion 
|Xi| = 3 is 1 X 2 1 = 13; observe fhaf bofh of fhese numbers are Fibonacci numbers. Thus fhe 
amounf of shiff was exacfly fhe lengfh of fhe square roofs added affer observing fhe previous 
shift. As an observant reader might have noticed, both of fhe words Xj and X 2 are reversed 
sfandard words, or equivalenfly, primifive solutions fo (8). Repeating similar inspections on 
ofher Sfurmian words suggesfs fhaf fhere is a cerfain pattern fo fhese shifts and fhaf knowing fhe 
paffern would make if possible fo locafe prefixes of ^/s in fhe Sfurmian word s. Thus if makes 
very much sense fo "accelerafe" fhe square roof map by considering squares of solufions fo (8) 
rnsfead of jusf minimal squares. Nexf we make fhese somewhat vague observations more precise. 

Every Sturmian word has a solution of (8) as a square prefix. Nexf we aim fo characferize 
Sfurmian words having infinitely many solutions of (8) as square prefixes. The nexf fwo lemmas 
are key resulfs fowards such a characferizafion. 

Lemma 6.1. Consider the reversed (semi)standard word s^ i of slope oc with k > 2 and 0 < i < aj^. The 
set \skj] \ {1 — a} equals the disjoint union 

( CO «A:+2i \ 1-1 

U U [4+21,/] \ 

,=0 ;=1 J 1=1 

Analogous representations exist for the sets [?o] \ {1 “ [n] \ {1 “ ®}- 

To puf if more simply: for each x 7 ^ 1 — a fhere exisfs a unique reversed (semi)sfandard word 
w such fhaf x £ [ro. To illusfrafe fhe proof, we begin by giving a proof skefch. 

Proof Sketch. Consider as an example fhe inferval [0] = f(0,1 — a). If is easy fo see fhaf [0^] = 
1(0, —2a) = 1(0,—(qo + l)a), so [0] = [0^] U I( — (qo + l)a,l — a). The inferval f(—+ 1 )n 1 — 
a) is fhe inferval of fhe factor S 2 ,i- Therefore [0] = [sg] U [? 2 ,i]- Since £ C(a), fhe inferval [s| 
splifs info fwo parfs: [s 2 ,i] = [^ 21 ]*^/- ft is sfraighfforward fo show fhaf / = I( — (q 2 ,i + 1-)a,l—a). 
Again, fhe inferval / is fhe inferval of fhe facfor zv which equals eifher $ 2,2 or S 41 depending on 
fhe number 02 - So [0] = [sq] U [? 21 ] U [a;]. This process can be repeafed for fhe inferval [ro] and 
indefinifely affer fhaf. The very same idea can be applied fo any inferval g ]. □ 

Proof of Lemma 6.1. Consider fhe lengfhs of fhe reversed (semi)sfandard words begirming wifh 
fhe same leffer as s^^g. Ouf of fhese lengfhs we can form fhe unique increasing sequence (hn) 
such thaf b\ = qk,g-i- If we sef si = s-f^^g and ft = I(—(bi + l)a,l — a), fhen based on fhe 
observations in the proof of Proposition 4.4 we see fhaf ft = [si]. The inferval ft is splif by fhe 
point { — + l)a} = { — (b 2 + l)a}. It must be that [sj] = I( — (bi +I)a,—(b 2 + l)a). Otherwise 

[s^] = [si] n R“^ 2 ([si]) = J( —(172 + l)to 1 — a), so the points { —+ l 72 )a} and {—bia} are on 
the opposite sides of 0. Furfhermore, || (Fj + 172 )^'^|| equals fhe disfance befween fhe poinfs { — bia} 
and { — b 2 a}, so || (bi + b 2 )oi\\ = ||(ji:_ia||. Since also fhe poinf {—qk-ii^} is on fhe side opposife fo 
{-bga}, if follows fhaf = bi+b 2 which is obviously false. Thus ft = ft \ [sf] = I(—(b 2 + 
l)a, 1 — a) is fhe inferval of S 2 , fhe unique reversed (semi)sfandard word of lengfh be, begirming 
wifh the same letter as Si. By repeating this when n > 1, we see that the interval ]„ is split by 
the point { — (b„_^.l + l)a} and that [s^] = I(—(b„ + l)a, —(b„^i + l)a). Then there is a unique 
reversed (semi)standard word s„+i such that [s„+i] = I(—(b„+i + l)a, 1 — a) = /„ \ [s^]; we set 
ft+i = [s„+i]. By the definition of fhe sequence (b„), fhe words s„+i and si begin wifh fhe same 
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letter. This yields a well-defined sequence (Jn) of nesfed subintervals of }\. If is clear thaf \fn\ —t 0 
as n —)• 00 . It follows fhat 

OO 

[sk,e] U {1 - a} = /i U {1 - a} = (J [si] U {1 - a.}. 

n—1 

The sefs [s^] are by definifion disjoint. The claim follows since fhe indexing in fhe claim is jusf 
anofher way fo express fhe reversed (semi)sfandard words having lengfhs from fhe sequence 

(bn)- 

The above proof works as if is for fhe cases Sg ^od ?i; only minor adjustmenfs in nofation are 
needed. □ 

Lemma 6.2. Let u e RStand'^{oc) and v e RStand^{oi) U L{RStand^{a.)). Then is never a proper 
prefix ofv^. 

Proof. If u e RStand^{a) and \u\ \v\, fhen by Lemma 6.1, fhe infervals [u^] and [v^] are disjoinf. 

Hence ip- can never be a proper prefix of v'^. Assume fhen fhaf v € L{RStand'^{a)). If bl < |si|, 
fhen is a minimal square, so if is nof possible for iP fo be a proper prefix of v^. Suppose 
fhaf 1^1 = for some k > 2 wifh 0 < (. < a^. As in fhe proof of Proposition 4.4, we have 
fhaf [u] = — + l)a, 1 — a). If u begins wifh fhe same leffer as v and |m| < \v\, fhen 

|m| ^ Isjt-il- It follows, as in fhe proof of Lemma 6.1, fhaf fhe disfance befween 1 — a and eifher 
of fhe endpoinfs of fhe inferval [u^] musf be at least Hence the intervals [u] and [u^] are 

disjoint, so is not a proper prefix of v^. □ 

Lef s be a fixed Sfurmian word of slope a. Since fhe index of a factor of a Sfurmian word is 
finite. Lemma 6.2 and Theorem 5.2 imply fhaf if s has infinifely many solufions of (8) as square 
prefixes fhen no word in RStand'^{a.) is a square prefix of s. We have now fhe proper tools to 
prove the following: 

Proposition 6.3. Let Sx,a be a Sturmian word of slope oc and intercept x. Then Sx,a begins with a square 
of a word in RStand^{a) if and only ifx — x. 

Proof. If X 7 ^ 1 — a, fhen x e Jg \ {1 — a} = [sg] \ {1 — a} or x e Ji \ {1 — a} = [?i] \ {1 — a}. 
Thus by applying Lemma 6.1foJg\{l — A;}orfi\{l — a}, we see fhaf fhe word Sx,a begins wifh 
a square of a word in RStand^{oc). 

Suppose fhen fhaf x = 1 — a. Then Sx,cc G {OlCa, lOca}. If is a well-known facf fhaf S 2 /c = 
P2k^0 and S 2 /C +1 = QiSi+iOl for some palindromes P 2 k and Q 2 k+i for every A: > 1 (see e.g. [11, 
Lemma 2.2.8]). As Cx = limj-_j.oo Sjt, if follows fhaf OlCa = limjt_j.oo § 2 *; and lOca = limj-_^oo S 2 jc+i- 
Hence by Lemma 6.2, fhe word Sx,a carmof have as a prefix a square of a word in RStand^(x). □ 

If follows fhaf if s has infinitely many solufions of (8) as square prefixes, fhen s G {01c«, lOca}. 
Nexf we fake one exfra step and characterize when s can be wriffen as a producf of squares of 
words in RStand~^(a). 

Theorem 6.4. A Sturmian word s of slope a can be written as a product of squares of words 
in RStand'^(a) if and only if s is not of the form ■ ■ ■ X^c where X; e RStand\x) and 

c G {OlCa, lOca}. If sis a product of squares in RStand^{x), then this product is unique. 

Proof. This is a direcf consequence of Proposition 6.3 and Lemma 6.2. □ 

Suppose fhaf s ^ {OlCa, 10c«}. Then fhe word s has only finifely many solufions of (8) as 
square prefixes. We call fhe longesf solufion maximal. Observe fhaf fhe maximal solution is nof 
necessarily primitive since any power of a solufion fo (8) is also a solufion. Sfurmian words of 
slope a can be classified info fwo fypes. 
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Type A. Sturmian words s of slope a which can be written as products of maximal solutions 
to (8). In other words, it can be written that s = ■ ■ ■ where X, is the maximal solution 

occurring as a square prefix of the word T^'is) where /z, = |XjX| ■ ■ ■ X^_j |. 

Type B. Sturmian words s of slope a which are of the form s = X^X^ ■ ■ ■ X^c where c G 
{OlCa, lOCff} and the words X, are maximal solutions as above. 

Proposition 6.3 and Lemma 6.2 imply that the words X; in the above definitions are uniquely 
determined and that the primitive root of a maximal solution is in RStand^[(x.). Consequently, a 
maximal solution is always right special. When finding the factorization of a Sturmian word as 
a product of squares of maximal solutions, it is sufficient to detect at each position the shortest 
square of a word in RStand^{a) and take its largest even power occurring in that position. 

Keeping the Sturmian word s of slope a. fixed, we define two sequences and (A/t). We set 
t'o = -^0 = £• Following the notation above, we define depending on the t 5 q)e of s as follows. 

(A) If s is of type A, then we set for all fc > 1 that 

= X^X^ ■ ■ ■ X^ and 
= X 1 X 2 ■ ■ ■ Xfc. 

(B) If s is of type B, then we set for 1 < k < n that 

in. = Xfxi ■■■Xl and 

A;t = X 1 X 2 ■ ■ ■ Xfc, 
and we let 

fin+i = X 1 X 2 ■ ■ ■ X^c and 

An+i = X 1 X 2 ■ ■ ■ X„c. 

Compare these definitions with the example in the begirming of this section; the words Xj 
and X 2 are maximal solutions in the Fibonacci word (which is of t 5 q)e A). 

We are finally in a position to formulate precisely the observations made in the begirming of 
this section and state the main result of this section. 

Theorem 6.5. Let she a Sturmian word with slope a. 

(A) Ifs is of type A, then 

\/i= lim rl^''l(s). 

k^co 

Moreover, the first occurrence of the prefix A/t+i of yfs is at position |Ajt| of s for all k >0. 

(B) Ifs is of type B, then 

Vs = ri^«i(s). 

Moreover, the first occurrence of the prefix Aj-+i with 0<k<n — lisat position |A/t| of s, and the first 
occurrence of any prefix of ^/s having lenght greater than \A„\ isatposition |A„| ofs. 

In particular y/s is a Sturmian word with slope a. 

The theorem only states where the prefixes Ajt of y/s occur for the first time. For the first 
occurrence of other prefixes of yfs we do not have a guaranteed location. 

To illustrate the theorem, consider next t, the eighth shift of the Fibonacci word. If we write 
under the word t each of the corresponding words at the position of their first occurrence we 
get the picture in Figure 2. Theorem 6.5 shows that the nice pattern where the words A/; overlap 
continues indefinitely and, moreover, that if we replace t with any other Sturmian word (of t 5 q)e 
A) we obtain a similar picture. Most of the results of this paper were motivated by the discovery 
of this pattern. 

Before proving the theorem we need one more result. 
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01001010010100100101001001010010100100101001010010010100100101001010010010100100 • • • 
010 j :i; t t t t t 

01010010 i i 

0101001001010010 i 4- 

01010010010100101001001010010 i 

01010010010100101001001010010010100101001001010010 


Figure 2: The first occurrences of the words Aj- in t. The eighth shift of the Fibonacci word was 
used since for the Fibonacci word the lengths | Aj^ \ grow very rapidly. 


Proposition 6.6. Suppose that s is a Sturmian word of type A. Then the word is right special and a 
suffix of the word pkfaf k>0. 

Proof This proof might be tricky to follow. We advise the reader to keep the picture of Figure 3 in 
mind while reading the proof. This picture depicts only the Case A below but is surely helpful. 

The assertion is evident when k = 0. Suppose that k > 0 and assume that Aj- is right special 
and that Aj^ is a suffix of the word It is equivalent to say that { —(|Aj-| + l)a} G [Aj-] and 
[Pk] ^ (evidently 2\A]^\ = \pk\)- We write simply A = A^-, p = pj^, and X = X;t+i- 

This proof utilizes only the facts that pX^ G that A is right special and a suffix of the 

word p, not the structure of the words A and p implied by their definitions. Thus without loss of 
generality, we may assume that X is primitive. Consequently, X G RStand~^{a). It follows that 

[X] = J( —(rj + !)«, 1 — a) and (9) 

[X2] = [X] n R-|X|([X]) = + l)a, -(|X| + 1 )«). 

for some nonnegative integer q. Let x = { —(ll^l + It follows from the hypothesis { —(|A| + 
1 )a:} G [a] that X G By (9) the point x is an endpoint of the interval R“ II ([X]). 

Let then y = {—(|yX| + l)a}. By (9) the point y is an endpoint of the interval R~ 
and an interior point of the interval R If'I ([X]). Suppose for a contradiction that y R |A| ([A]). 
As X fz R“lf'X|([X]) (otherwise it would follow that 1 — a G R“|x|([X]) which contradicts (9)), it 
follows that R“1 a|([A]) D R“lf'X|Qx]) = 0. Since 

[y,+i]-[y]nR-lf'l([X])nR-lf'X|([x]), 

we have that [pk+i] Q R~lf'X|([X]). By assumption [pk+i] Q [p] Q X~|a|([A]). Thus [pk+i] Q 
r-|a|([a]) nR-lf'Xl([x]), so by the above we are forced to conclude that [pk+i] = 0- This is 
a contradiction since X is chosen in such a way that [y^+i] = [pX^] 7 ^ 0. We conclude that 
yGR-l^l([A]). 

Now R-|a|([AX]) = R-|a|([A]) nR-lf'l([X]). SinceyG R-|A|([A]),R-lf'l([X]), it follows that 
y = { —(|yX| + l)a} G R“|a| ([AX]). Thus R|a (y) = { — (|AX| + l)a} G [AX], so the word AX is 
right special. We have two cases depending on the length of the interval R~lf'l ([X]) compared to 
the length of the interval R“ I A| ( [A] ) . 

Case A. R“lf'l([X]) ^ R“|a|([A]). In this case R~Ia|([AX]) = J(x,z) where z an endpoint of 
R“|a|([A]). Since y is an interior point of R“|a|([AX]), R“|x|(x) = y, and x ^ R“|AXl([[yx]), 
we obtain that t(y,z) C R~|a^I ([AX]). Since y is also an interior point of R~|a| ([A]), we obtain 
similarly that R~|A| ([A]) D R“lf'l([X^]) = t(y,z). Thus 

iFk+l] = M nR-tf'l([x2]) C R-|A|([A])nR-lf'l([x2]) = t(y,z) c R-|AXI([ax]). 

This proves that AX = Ajt+i is a suffix of y^t+i- 
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a: y 


[y] 

R-I"I([A]) 

R-M{[X]) 

R-lf^ldX]) 

K-I''I([AX]) 

R-I'^*I{[AX]) 

R-|'‘I([x 2 ]) 


Figure 3: A possible arrangement for the intervals in the Case A of the proof of Proposition 6.6. 
The blue color marks the interval [^X] and magenta marks the interval [}iX^] = 

Ffc-i Xj. Xj. Xi.+i Xjt+i 

I-^^-1-^-1 

w w w w 


I—I—^^—I 


I- 1 

^k-\ 


X, 


X, 


+ 


4- 


k+l 


Figure 4: Possible locations for factors in the proof of Theorem 6.5. 


CaseB. R-lf'l([X]) C R-l'^t([A]). It follows that R-I'^I([AX]) = R-lf'l([X]), so R-I'^^I([AX]) = 
R-|yX|([x]). Since R-I''I([X2]) C ([X]), we get that 

[h+i] = [F]nR-\f^\{[x^]) c [H]nR-l?'^l([x]) c [^] nR-l^^l([AX]) 

proving that also in this case AX = Aj-_|_x is a suffix of }ik+i- ^ 

Note that even though Aj- is right special and always a suffix of it is not necessary for to 
be right special. 

Proof of Theorem 6.5. Since Sturmian words of t 5 rpe B differ from Sturmian words of t 5 rpe A essen¬ 
tially only by the fact that the sequence of maximal solutions is finite, it is in this proof enough to 
consider the case that s is of type A. 

Proposition 6.6 says that Xp. is always a suffix of for all k > 0. Since = 2|Aj-|, it follows 
that the word (s) has the word Xp- as a prefix. Therefore ^/s = limj-_j.oo (s). 

It remains to prove that the first occurrence of Aj-+i in s is at position | Ajt | of s for all k > 0. It is 
clear that the first occurrence of Aj = Xp is at position |Ao| =0. Assume that k > 0, and suppose 
for a contradiction that Aj-+i occurs before the position \Xp\. Since Xp is a prefix of Xp^_i, by 
induction we see that Xp^i cannot occur before the position | Xp_i \. This means that an occurrence 
of XpXp_^_■^ begins in s at position v such that \}ip-i | < v < \}ip-iXp\; see Figure 4. Observe that s 
has at position \}ip-i \ an occurrence of X^. Write now Xp = with w e RStand^{a). Since w is 
primitive, we must have that v = + ^"l^l with 0 < r < t. Thus Xp^i occurs in s at position 

V + |X;tl = \Tk-i ! + ('' + t)\xv\- Since r < t, it follows that either ro is a prefix of X/t+i or X;t-i-i is a 
prefix of w. 

Suppose first that re is a prefix of Xp_f_i. lfw = Xp^p, then the prefix }ip_iX^ of s is followed by 
iv^. Now is a solution to ( 8 ) implying that Xp is not a maximal solution to ( 8 ). Since this is 
contradictory, we infer that |ro| < |Xjt+i|. Since Xp_^_l occurs atposition \}ip-i\ -!-()' + f)|w| < \}ip\ 
and X;t-i-i has w as a prefix, it must be that X;t-i-i begins with wa where a is the first letter of w. 
Since lu is right special and € £(a), it follows that X^^^ begins with lo^. Like above, this 
implies that Xp is not maximal. This is a contradiction. 
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Suppose then that X;t+i is a proper prefix of w. Firsf of all, musf be primitive, as ofh- 
erwise X^^+i and consequenfly w would have as a prefix a square of some word in RStand^{a.) 
confradicfing Lemma 6.2. The assumption that X/^+i is a prefix of w implies fhaf X/^+i and w 
begin wifh fhe same letter. Like above, since w is righf special and e £(a), if musf be fhaf 
w occurs after the prefix of s- Since also X^^^ occurs after fhe prefix by Lemma 6.2 we 
conclude fhaf fhe word w musf be a proper prefix of X^^^. Observe now fhaf fhe assumption 
fhaf Xjt+i is a proper prefix of w excludes fhe possibilities that if = sq = 0 or a; = ?i = 10 “. 
Therefore w = for some h > 2 with 0 < (. < a^. Because |if| < 2|Xj-+x|, we must have that 
|Xj-_,_x| > |sj!,_ 2 |. On the other hand, since |Xj-_|_x| < |if| and Xj-_(.x and w begin with the same 
letter, the only option is that Xj-^x = with 0 < f" < f. Now 

Xf+X = {Sh-2Sh-lf = Sh-2Sh-l^{\-l)Sh-2ShIi, 

SO as If is a prefix of X^^^, if musf be fhaf Sj!,_x = L(sj!,-i). This is a confradicfion. This final 
confradicfion ends fhe proof. □ 

As a conclusion of fhis secfion, we sfudy fhe lengfhs of fhe maximal solutions of ( 8 ). Namely, 
lef s = X^X| ■ ■ ■ be a Sfurmian word of fype A facforized as a producf of maximal solutions 
X,. Compufer experimenfs suggesf fhaf fypically fhe sequence (|X, |) is sfricfly increasing. How¬ 
ever, fhere are examples where |X, | > |X,_|_x| for some z > 1. If is nafural fo ask if fhe lengfhs 
can decrease significanfly or if oscillation is possible. If fums ouf fhaf neifher is possible. In 
Corollary 6.9 we prove fhat liminf,_j.oo |X, | = oo. 

Firsf we need a resulf on cerfain periods of (semi)sfandard words. 

Lemma 6.7. Let u,v E Stand^{oc) and \u\ > |iz|. Ifu is a prefix of some word in iz+, then u = s^^f and 
V = s^_ifor some k>2 with 0 < I < a^. 

Proof. Suppose fhaf u is a prefix of some word in iz+. If iz = sx = 0“1, fhen necessarily iz = Sq = 0. 
Then obviously u is nof a prefix of any word in iz'*'. Therefore u = s^/ for some k > 2 wifh 
0 < £ < Suppose fhaf k = 2. Then u = (0“1)^0. If is sfraighfforward fo show fhaf v musf 
equal fo Sx = 0 “ 1 ; m cannof be a prefix of a word in iz+ if iz = sq = 0 or iz = $2 gi for some £' such 
fhaf £)<£'<£. Thus we may assume fhaf k > 2. 

Suppose firsf fhat |iz| > |s/t_x|. Then by the assumption \ii\ > |iz|, it must be that v = Sj^^p 
for some £' such fhat £' < £. Since zz is a prefix of some word in iz+, it follows fhaf fhe word 
w = Sj ^_2 is a prefix of some word in S]f_ 2 V~^. Since fhe word w begins wifh Si-_xSi^_ 2 , we 

obfain fhaf Sj(_ 2 V begins wifh sif_iSif_ 2 , so Sif_iSif _2 = Si^_ 2 Si-_x. This is a confradicfion. 

Assume fhen fhaf |iz| < |Sjt_i|. Now fhe prefix sjt-x of zz is a prefix of some word in iz"*", so 
by induction v = Si-_ 2 . Now u = so as zz is a prefix of some word in iz+, it 

follows fhat z = S]f_^Sjf _2 is a prefix of some word in iz"*". This means fhat z ends with a prefix 
of Sk -2 of lengfh |si:_ 3 |. As fhe prefix of s ^-2 of lengfh |sjt_ 3 | is Sfc_ 3 , fhe word z ends wifh 
Consequenfly Sjt_ 3 S;t -2 = Sk-2^k-3i ^ confradicfion. 

The only remaining option is fhaf iz = This is cerfainly possible. □ 

The nexf proposition describes precisely under which conditions it is possible that |X, | > 

I X,_|_x I. Moreover, it rules out the possibility that the lengths decrease significantly or oscillate. 

Proposition 6 . 8 . Let s = X^X^Xg ■ ■ ■ be a Sturmian word of type A with slope a factorized as a product 
of maximal solutions X;. If \Xi\ > |X 2 |, then Xx = Sk,e for some k > 2 with 0 < £ < a^ — 1, the 
primitive root of X 2 is Sk-\, and \X 2 ,\ > |Xx|. 

Proof. Assume that |Xx | > IX 2 I. Let us first make the additional assumption that Xx is primitive. 
In particular, Xx E RStand^{oc). Let u be the primitive root of X 2 . Then zz E RStand'^{a) and, 
moreover, by fhe assumption I Xx I > IX 2 I it holds that |zz| < |Xx|. By Proposition 6.6 the word 
A 2 = XxX 2 is a suffix of fhe word p 2 = X^X^. Therefore Xx is a proper suffix of XxX 2 , so 
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XiX^ = ZXi for some nonempty word Z. A standard argument shows that Xi is a suffix of some 
word in X^ (see e.g., [10, Proposition 1.3.4]). Consequently, Xj is a prefix of a word in u~^. As 
\u\ < |Xi |, Lemma 6.7 implies that that Xj = and u = for some k >2 with 0 < i < aj^. 
Suppose now that i = aj-. Then the word X^X^ contains ^ factor. Thus 

e £{oc). As Sk-i is a prefix of Sk- 2 Sk-i> h follows that e £{oc) contradicting 
Proposition 2.7. Therefore i < aj^ — 1- 

Let us then relax the assumption that Xi is primitive. Let v be the primitive root of Xi, so 
that Xi = for some j > 1. Consider now the Sturmian word = v^X^ ■ ■ ■ ■ By the 

above arguments v = Sk(, for some k > 2 with 0 < t? < flj- — 1 and the primitive root of X 2 is S(t_i. 
Further, as i ^ Uk, it follows from Proposition 2.7 that i £{dl). Thus ] = 1, that is, Xj = Jk,e- 
It remains to show that IX 3 I > |Xi|. Assume for a contradiction that IX 3 I < |Xi|. It is 
not possible that | X 3 1 < | X 2 1 as the preceding arguments show that then X 2 must be reversed 
semistandard word; however, X 2 is a power of the reversed standard word 'Sk-i- Hence by the 
maximality of X 2 we have that IX 3 I > |X 2 |. Let X 3 = with w e RStand^{a) and f > 1. As 
l^il < l^sl < |Xi|, we have that |sj-_i| < t\w\ < 

Assume for a contradiction that | < |sj-_i|. If w is semistandard, then Proposition 2.7 im¬ 
plies that t = 1, so t\iu\ > |sj:_i| carmot hold. Thus zv is standard. If a; = sq =0, then clearly 
t\iv\ > I Sk- 1 1 > I Si I carmot hold as the index of the factor 0 in £ (a) is a -|-1. Thus ro 7 ^ sq . Suppose 
first that zv = Sk- 2 - Now 

t\zv\ > = ak_i\Sk- 2 \ + |Sjt- 3 |, 

so t > a^_i. Since X 3 G £{(£), Proposition 2.7 implies that 2t < -H 2. Therefore 

dk-i -|- 2 > 2f > 2flj-_i 

implying that fl/t_i = 1. However, if Uk-i = 1, then Uk-i -|- 2 is odd, so actually 2t < Uk-i -h 2. 
Then %-i + 2 > 2t > 2a^_i, so a^_i < 1; a contradiction. Suppose then that zv = Sk- 2 ,- Now 

t\zv\ > |Sfc_i| > |Sfc_ 2 S;t_ 3 | = \sfs^Sk-iSk-3\ > (%-2 + 1) Is/t-sl 

SO t > a]f _2 -|- 1. Like previously, as X 3 G £{oi), Proposition 2.7 implies that 2t < af ^_2 + 2. Like 
above, we obtain that flj -_2 < 0; a contradiction. Similar to above 

|S;:_l| > {ak-2 + l)|Sj:_3| + |Sfc-4l > 2|sj-_3| -|- |Sfc_4| > (2fljt-3 + l)|Sfc-4|- 

As 2 flj -_3 -h 1 > fli :_3 + 2, we conclude that < |s;t_i|. Therefore by Proposition 2.7 it 

is not possible that |a;| < |si-_ 4 |. In conclusion, it is not possible that t\zv\ > |sj-_i|. This is a 
contradiction. 

Now |m;| > |?;t_i I (by the maximality of X 2 it must be that zv 7 ^ sjt-i)- Because |a;| < \sk ,£\/we 
have that zv = Jk/' for some i' such that 0 < £' < Z. Since ^ 7 ^ a^, the word zv is semistandard so 
by Proposition 2.7 we have that f = 1. By Proposition 6.6 the word A 3 = X 1 X 2 X 3 is a suffix of the 
word ]i 3 = X 3 X 2 X 3 . It follows that Jk-iSk-i ~ where r is such that X 2 = 

Therefore the words %-2 ^rid %-i commute; a contradiction. This final contradiction proves that 
IX 3 I > |Xi|. □ 

Corollary 6.9. Let s = X^X^ ■ ■ ■ he a Sturmian zvord of type A zvith slope a factorized as a product of 
maximal solutions X,. Then liminf,_>oo |N;| = 00 . 

Proof. This follows from Proposition 6 . 8 : if |X,-^i| < |X,| for some f > 1, then |X,_|_ 2 | > |X, |. □ 
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7 The Square Root of the Fibonacci Word 


In this section we prove a formula for fhe square roof of fhe Fibonacci word. To do fhis we 
factorize fhe Fibonacci word as a producf of maximal solufions to ( 8 ). 

We denote by O fhe slope of fhe Fibonacci word, fhaf is, O = [0; 2,1,1,...]. Furfher, we sef 


h 


01 , if k is even, 
10 , if k is odd . 


We need two lemmas specific fo fhe slope O. 

Lemma 7.1. For the standard words of slope O it holds that tj^SkSk+iSk+i = k >0. 

Proof. The case fc = 0 is verified direcfly: foSoSiS2 = 01 ■ 0 ■ 01 ■ 010 = (010)^ ■ 10 = sf h- Let then 
fc > 1. There exists a palindrome such that sj^ = Pj^^tj^- for all > 1 (see e.g. [11, Lemma 2.2.8]). 
Now 


lkSkSk+lSk+2 — lkPkhSk+lSk+2 — SklkPk+lh+lSk+2 — Sktk+lPk+ltk+lSk+2 
= SkSk+ltk+lPk+2lk+2 = SkSk+lSk+2lk+2 = Sk+2^k+l> 

which proves the claim. □ 

Lemma 7.2. For the standard words of slope O it holds that s^^+i = Y\ 4 =o^ii +2 ' ^k+ifa^ k> 0. 
Proof If A: = 0, then S 4 = 01001010 = s^ti- Let then A: > 1. Now 

S3k+i = S3k+3S3k+2 = S3k+2S3k+1^3k+2 = S3k+lS3kS3k+1^3k+2 

k-1 

= S3{k-l)+i^3kS3k+lS3k+2 = JT ^3i+2 ' tkS3kS3k+1^3k+2 

i=0 

where the last equality follows by induction. By applying Lemma 7.1 we obtain that 

k-l k 

S3k+4: = n ®3!+2 ■ ^3k+2^k+l = ri®3!+2 ' 

;=0 i=0 

which proves the claim. □ 

As an immediate corollary to Lemma 7.2 we obtain a formula for the square root of the Fi¬ 
bonacci word. 

Theorem 7.3. For slope O we have that 

00 CO 

^‘t’~n® 3!+2 = Si ,J, = J 3 s 3 ,+ 2 - 

1=0 ! = 0 

The preceding arguments are very specific to the Fibonacci word. The reader might wonder 
if formulas for the square roots of other standard Sturmian words exist. Surely, for some specific 
words such formulas can be derived, but we believe no general factorization for the square roots 
of standard Sturmian words can be given. Let us give some arguments supporting our belief. 

Let s = ■ ■ ■ be a standard Sturmian word of slope a factorized as a product of maximal 

solutions to (8). The word s begins with the word 0“1. Therefore if a > 1, then Xi = . Thus 

if a > 1, then X 2 begins with 0 if and only if a is odd. Because of the asymmetry of the letters 
0 and 1 in the minimal squares of slope a ( 1 ), the parity of the parameter a greatly influences 
the remaining words X;. Moreover, it is not just the partial quotient ai which influences the 
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^4 

1 

2 

3 

1 

2,5,8 

2,4,7 

2,5,8 

2 

2,3,6 

2,4,7 

2,3,6 

3 

2,3,5 

2,4,7 

2,3,5 


Table 1: How Xj, X 2 , and X 3 are affected when fl 3 and 04 vary in the case that ai =2 and fl 2 = 1- 


^3 

1 

2 

3 

1 

1 

0 

1 

2 

1 

0 

1 

CO 

0 

0 

0 


Table 2: How the first letter of X 4 varies when and 04 vary in the case that aj = 2 and ^2 = 1- 


factorization. Suppose for instance that ai = 2 and = 1- Table 1 shows how the values of 
the partial quotients 03 and 04 affect the words X,. The cell of the table tells to which squares 
of reversed standard words the words Xi,X 2 and X 3 correspond to. For example if 03 = 2 and 
fl 4 = 1, then the standard Sturmian word of slope [ 0 ; 2,1,2,1,...] begins with S 7 . Table 2 tells 
the first letter of the corresponding word X|. As can be observed from Table 2, the first letter of 
X| varies when and vary. Because of the asymmetry it is thus expected that slight variation 
in partial quotients drastically changes the factorization as a product of maximal solutions to ( 8 ). 
Since similar behavior is expected from the rest of the partial quotients, it seems to us that no 
nice formula (like e.g., the formula of Theorem 7.3) can be given for the square root of a standard 
Sturmian word in terms of reversed standard words. 

De Luca and Fici proved a nice formula for a certain shift of a standard Sturmian word [4, 
Theorem 18]. 

Proposition 7 . 4 . Let c« be the standard Sturmian word of slope a. = [ 0 ; o + 1 , b + 1 , ...]. Then 

CO 

c„=o“io“-inst- 

k=l 

As a corollary of this theorem we obtain that the word is a Sturmian 

word of slope a with intercept f>{{ (2a + l)a}) = aa. We have thus shown that 

CO 

Ccc= 

In particular, we obtain the well-known result that the Fibonacci infinite word is a product of the 
reversed Fibonacci words. 

8 A Curious Family of Subshifts 

In this section we construct a family of linearly recurrent and optimal squareful words which are 
not Sturmian but are fixed points of the (more general) square root map. Moreover, we show that 
any subshift Q generated by such a word has a curious property: for every ic € Q either ^/w £ O 
or -y/a; is periodic. 

It is evident from Proposition 2.2 that Sturmian words are a proper subclass of optimal square¬ 
ful words. As Sturmian words have the exceptional property that their language is preserved 
under the square root map, it is natural to ask if other optimal squareful words can have this 
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property. We show that, indeed, such words exist by an explicit construction. The idea behind 
the construction is to mimic the structure of the Sturmian words OlCa and lOCa. The simple reason 
why these words are fixed poinfs of fhe square roof map (fhus preserving the language) is that 
they have arbitrarily long squares of solufions fo (8) as prefixes. Thus fo obfain a fixed poinf of 
fhe square roof map, if is sufficienf fo find a sequence {ui^) of solufions fo (8) wifh fhe properfy 
fhaf is a proper prefix of for all A: > 1 . Lef us show how such a sequence can be obfained. 

Lef S be a fixed primifive solufion to (8) in the language of some Sfurmian word wifh slope 
[ 0 ; a + 1 , b + 1 ,...] such fhaf |S| > [Sej. In parficular, S has fhe word Se = as a 

proper suffix. Recall from fhe proof of Lemma 5.6 fhaf |S| > IS5S5I. We denofe fhe word L(S) 
simply by L. Using fhe word S as a seed solufion, we produce a sequence (7^^) of primifive 
solufions fo (8) defined by fhe recurrence 

7 i = S, 7 ,t+i = U 7 k) 7 k for ^ ^ 2 . ( 10 ) 

We need fo prove fhaf fhe sequence (7^^) really is a sequence of primifive solufions fo (8). Before 
showing fhis, lef us define 

Ti = lim 72fc and T2 = lim 72^+1. (11) 

k^oo /c—>-oo 

The limifs exisf as 7^ is always a prefix of Jk+i- Hence bofh Ti and r2 have arbifrarily long 
squares of words in fhe sequence {jjf) as prefixes. Observe also fhaf C{Yi) = C{T2)- As fhere is 
nof much difference between T i and r2 in ferms of sfrucfure, we sef T fo be eifher of fhese words. 

Taking for granfed fhaf fhe sequence (qjt) is a sequence of solufions fo (8), we see fhaf -v/T = T. 
Nofe fhat we also need fo ensure fhaf fhe word T is opfimal squareful for fhe square roof map fo 
make sense. 

Nexf we aim fo prove fhe following. 

Proposition 8 . 1 . The ivord 7^- is a primitive solution to (8) in £(o, b)/or all k> 1 . 

Recall from Secfion 5 fhaf fhe language £(a, b) consisfs of all factors of fhe infmife words in 
the language 

(10a+l(ioa)b ^ ^ ^ 5^)0,^ 

Before we can prove Proposifion 8 . 1 , we need fo know fhaf fhe words 7^^ are primifive and 
fhaf fhey are facfors of some opfimal squareful word wifh paramefers a and b. 

Lemma 8.2. The ivord is primitive for allk> 1 . 

Proof. We proceed by induction. By definition 71 is primitive. Let A: > 1 , and suppose for a 
confradicfion fhaf 7^+1 is nof primifive; fhaf is, 7^+1 = = z” for some primifive word 

z and n > 1 . If n = 2 , fhen obviously |7j-| musf be even, and fhe suffix of 7j- of lengfh |7i^|/2 
musf be a prefix of This confradicfs fhe primifivify of The case n = 3 would clearly imply 
fhaf 7i: = Lfyfj, which is nof possible. Hence n > 3 , and furfher |z| < {ykl- As 7^ is a suffix of 
some word in , if follows fhaf z = uv where vu is a suffix of 7^^. On fhe ofher hand, z is a suffix 
of 7 j^, so uv = vu. Since z is primitive, the only option is that u is empty. Therefore 7^^ G z“*"; a 
confradicfion wifh fhe primitivify of 7^-. □ 

Lemma 8 . 3 . We have that L(7j-) e £(a, b)/or all k> 1 . 

Proof. For a suifable slope a = [ 0 ; a + 1 , b + 1 ,...], eifher of fhe words S and L is a reversed 
sfandard word of slope a. Thus by Theorem 5.2 bofh and are in £ (a), so S^, if e £(a, b). 

We clearly have fhaf 71 e £ ( a, b). Nofe fhaf by fhe assumpfion | S | > | Se | bofh of fhe words S 
and L have fhe word s = Se = 10 “+^ ( 10 “)*’"^^ as a proper suffix. Write S = us. Since s begins with 
10 “"''^ and has sms as a suffix, it follows fhaf us e (S5 + Se)'*’. Using the fact that L e £(a, b). 
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we see that 72 = LSS = L{u)s{us)^ e £(o, b). Clearly 1 ( 72 ) = = (us)^ E £(a, b). Proceeding 

by induction we may assume that k > 2 and 7 ^-, L( 7 /c) E £(a, b). Since 7 ^- has either S or L as a 
prefix, it can be written that = vszs with |to| = |S|. It follows fhat sz E (S5 + Se)'*’. Since si^s is 
a suffix of eifher or L^, we have fhaf sv E (S5 + Se)'*'. Therefore si^sz E (S5 + Se)'*'. As 1(7*:) = 
L(vsz)s E £(a, b), we have fhaf L{vsz) is a suffix of some word in (S5 + S6)+. Overall, fhe word 
7 *:+! = L{vsz){svsz)^s is in £(a, b). Clearly fhen musf fhe word L( 7 *;+i = {vszs)^ = vsz{svsz)^s 
also be in £(a, b). □ 

Nofe fhaf wifhouf fhe assumption |S| > |S 6 | fhe conclusion of fhe above lemma fails fo hold. 
If S = Sg = 10“+i(10“)‘’+i, fhen L = 0(10“)‘’+2 and IS = 0(10“)‘’+2l0“+i(10“)‘’+i. Therefore 
LS ^ £(a, b), and consequently 72 = LS^ ^ £(a, b). 

Proof of Proposition 8.1. We proceed by induction. By Lemma 8.2 the word 7 *- is primitive for all 
fc > 1. Lemma 8.3 fells fhaf bofh of fhe words 7 * and £( 7 *) are in £(a, b) for all k > 1. By 
definition bofh 71 and L(ji) are solutions fo ( 8 ). We may fhus assume fhat k > 1 and both 7 *- 
and are solutions to ( 8 ). It follows from Lemma 5.6 fhaf 

7 /ci( 7 fc) e n(a, b) and = 7k- 

Since L{7k) is a solufion fo ( 8 ), Lemma 5.6 also implies fhaf 

i'( 7 ;c) 7 fc e n(a,b) and L{7k)7k = 

Because 

7k+i = Hikhk ■ jkHik) ■ jI 

we obfain thaf 

ll+i E n(a, b) and = \/L{7k)7k\/7kU7k)^/li = k{7k)7k7k = 7k+i- 

This proves fhaf 7k+i is a solufion fo ( 8 ). Consider nexf fhe word L( 7 *-.gi) = 7 ^. Because 
(L( 7 jt+i))^ = ( 7 ^)^/ it is evidenf fhaf 

(i'(7;c+i))^ e n(o, b) and {L{7k+i)Y = 7k = H7k+i)- 
Therefore also L{7k+i) is a solufion fo ( 8 ). The conclusion follows. □ 

As we remarked earlier, we have now proved fhaf T is a fixed poinf of fhe square roof map. 
Next we show that the word T is aperiodic, linearly recurrent, and not Sturmian. 

Lemma 8 . 4 . The word 7 ^ is not a factor of any Sturmian word. 

Proof. By definition 72 = LS^. Write S = xyw and L = yxw for some word w and disfincf leffers 
X and y. Now ^2 = xyxw{xywYyxw{xywY, so fhe word 72 has factors xwx and ywy. Hence 72 
is nof balanced, and if carmof be a factor of any Sfurmian word. □ 

Lemma 8 . 5 . The word T is aperiodic and linearly recurrent. 

Proof. The recurrence (10) and fhe definifion (11) of T show fhat for all fc > 1 fhe word T is a 
producf of fhe words 7k+i = ^{7k)7k ^nd L( 7 j;+i) = 7 ^ such fhaf befween fwo occurrences of 
L( 7 *_|_i ) fhere is always 7 ^ or 7 ^. From fhis if follows fhaf fhe refum fime of a factor of T of lengfh 
7 * is af mosf fhe refum fime of fhe factor L( 7 *), which is af mosf 6 | 7 *:|. Lef fhen a; be a factor of 
T such thaf | 7 *-| < |h;| < | 7 *-+i|. Since ro is a factor of some factor of T of lengfh | 7 *-+i|, if follows 
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that the return time of w is at most 6 | 7 i^_|_i|. Now 6 | 7 i-+i| = 18|7jt| < 18|w| proving that T is 
linearly recurrent. 

The preceding shows that is followed in £(r) by bofh 7 ^^ and L{'yj^). As fhe firsf letters of 
7 jt and are disfincf, fhe facfor 7 j- is righf special. Thus £(r) confains arbifrarily long righf 
special factors, so T musf be aperiodic. □ 

Since linearly recurrenf words have linear facfor complexify [5, Theorem 24], if follows from 
Lemma 8.5 fhaf T has linear facfor complexify. 

We observed in fhe previous proof fhaf fhe word T is a producf of fhe words S and L such 
thaf befween two occurrences of L in fhis producf fhere is always or S®. Since S and L are 
primifive, any word w G £(r) which is a producf of fhe words S and L such fhaf |w| > 6 |S| 
musf S 5 mchronize fo fhe factorization of T as a producf of fhe words S and L. Thaf is, for any 
facforizafion T = uwT' we musf have fhaf | m | is a mulfiple of | S |. 

Theorem 8.6. The word T is a non-Sturmian, linearly recurrent optimal squareful word which is a fixed 
point of the square root map. 

Proof. The facf fhat T is opfimal squareful and linearly recurrenf follows from Lemmas 8.3 and 8.5. 
The argument outlined at the beginning of fhis section shows fhaf T is a fixed poinf of fhe square 
roof map as by Proposition 8.1 fhe words 7 ^^ which occur as square prefixes in T are solufions fo 
( 8 ). Finally, T confains fhe facfor 7 ^, so T is nof Sfurmian by Lemma 8.4. □ 

Denote by fl fhe subshiff consisting of fhe infinife words having language £(r). As T is 
linearly recurrent, it is uniformly recurrenf, so fhe subshiff fl is minimal. The resf of fhis secfion 
is devoted fo proving fhe resulf menfioned in fhe beginning of fhis secfion. 

Theorem 8 . 7 . For allwED. either ^/w E D. or ^Jw is (purely) periodic with minimal period conjugate 
to S. Moreover, there exists words u,v E d such that -Ju E O and -Jv is periodic. 

This resulf is very surprising since if is confrary fo fhe plausible h 5 rpofhesis fhaf an aperiodic 
word musf map fo an aperiodic word under fhe square roof map. 

If is nof difficulf fo prove Theorem 8.7 for words in O which are producfs of fhe words S and 
L. We prove fhis special case nexf in Lemma 8 . 8 . However, difficulties arise since a word in Ci 
can sfarf in an arbifrary posifion of an infinife producf of S and L. There are cerfain well-behaved 
positions in S and L which are easier fo handle. Theorem 8.7 is proved for fhese special positions 
in Lemma 8.10. The resf of fhe efforf is in demonsfrating fhaf all fhe ofher cases can be reduced 
fo fhese well-behaved cases. We begin by proving fhe easier cases, and we conclude wifh fhe 
reducfions. 

Lemma 8.8. If a word w E Cl can be written as a product of the words S and L, then y/w E O. 

Proof. Any word u which is a producf of fhe words S and L can be nafurally written as a binary 
word u over fhe alphabef {S, L}. If such a word u has even lengfh, then it is a word over the 
alphabet A = {SS, SL, LS, LLj. Using the fact that \/^ = S, = S, \/LS = L, and \/LL = L 
(see Lemma 5.6), we can define a square roof for a word over A. 

The word 7 ^ is a prefix of T for all A: > 1. Thus 7 /t has occurrences af positions 0 and | 7 j^| of T. 
Clearly | 7 i^| = 3*^~^|S|, so fhe word 7 j. occurs in T in an even and in an odd posifion. 

Lef vhea prefix of w of lengfh |i;| = 2n\S\ for some n > 1, so U is a word over A. Since v is 
a prefix of w, fhe word c is a facfor of some Since occurs in T in an even and in an odd 
posifion, fhe word v occurs in an even posifion in T. Hence T can be factored as T = zvt where z 
and t are finife or infinife words over A. Since T is a fixed poinf of fhe square roof map, we have 
fhaf T = sfzs/TsTt. Hence y/v G £(r). It follows fhat £{y/w) C £(r), so y/w E Cl. □ 
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Definition 8.9. Let a; be a word and (. be an integer such that 0 < (. < \w\. li the factor of of 
lengfh I vip -1 sfarfing af posifion (. can be wriffen as a producf of minimal squares X^,..., X^, fhen 
we say fhaf fhe posifion ^ of rc is repetitive. If in addifion |Xj ■ ■ ■ X^,| 7 ^ |w| — 4 ^ for all m 

such fhaf 1 < m < n, fhen we say fhaf fhe posifion £ is nicely repetitive. 

For example if a = 1, b = 0, and S = 1001001010010, fhen fhe posifion 1 of S is repefifive 
as fhe factor 00100101001010010010100101 of of lengfh |S^| =26 sfarfing af posifion 1 is in 
n(a,b). This posifion is nof nicely repefifive as |0^ ■ (10010)^| = 12 = |S| — 1. The posifion 2 of S, 
however, can be checked fo be nicely repefifive. The posifion 4 of S is nof repefifive as fhe factor 
00101001010010010100101001 of lengfh 26 sfarfing af posifion 4 is nof in n(o, b). 

In fhe upcoming proof of Theorem 8.7 we will show fhaf if w G O is a producf of fhe words S 
and L and ^ is a nicely repefifive posifion of S, fhen fhe word is always periodic. On fhe 

ofher hand, we show fhaf if i is nof a nicely repefifive posifion fhen ^T^{w) is always in O. 

Nexf we identify some good positions in fhe suffix Se of S. As we observed in fhe proof of 
Lemma 5.6, fhe suffix S(, of S resfricfs locally how a factorization of a word as a producf of mini¬ 
mal squares continues affer an occurrence of S 5 . Consider a producf Xj ■ ■ ■ X^ of minimal squares 
which has an occurrence of Se af posifion £. Then for some m G {1,..., n} fhe minimal square 
X^ musf begin af some of fhe posifions £,£ + 1,... ,£ + \S(,\ — 1. Ofherwise some minimal square 
would have S 5 as an inferior facfor; yef no such minimal square exisfs. Among fhe posifions 
£,£ + 1, ...,£+ \ S(,\ — 1 we are inferesfed in fhe largesf posifion where a minimal square may 
begin. Lef 

B = {£ ^ {0,..., iSel — 1}: no square of lengfh af mosf [Se] — £ begins af posifion £ of Se}. 

If is sfraighfforward fo see fhaf 

B = {|S6| - iSel, |S6| - IS 4 I, iSel - IS 3 I, iSgl - |Si|}. 

We are inferesfed in fhose posifions of fhe suffix S 5 of S where no minimal square begins. Hence 
we define 

Bs = {£-.£-\S\ + |S 6 | eB} = {|S| - jSel, |S| - IS 4 I, |S| - IS 3 I, |S| - |Si|}. 

A consequence of fhe definifions is fhaf if ^ is a posifion of S such fhaf £ ^ Bs, then there exists 
£' G Ss U {|S|} such that S[£,£' — 1\ G n(a, b). This fact is used later several times. 

Lemma 8.10. Suppose that ro G fl can be written as a product of the words S and L. Assume that the 
position £ E Bs is nicely repetitive. Let the prefix ofT^{w) of length |S^| be factorized as a product of 
minimal squares Xf ■ ■ ■ X^. Then the word y'TUw) is periodic with minimal period Xi ■ ■ ■ X„. Moreover, 
Xi ■ ■ ■ X„ is conjugate to S. 

Proof Sketch. As £ is repetitive, the factor u of lengfh |S^| of sfarfing af posifion ^ is in n(a, b). 
If we subsfifufe fhe middle S in wifh L, fhen an applicafion of Lemma 5.5 shows fhaf fhe facfor 
of lengfh |S^| of SLS sfarfing af posifion £ is still in n(a, b) and fhaf fhe square roof of fhis facfor 
coincides wifh fhe square roof of u (here we need fhaf £ G Bs). Furfher analysis shows fhaf if we 
subsfifufe fhe words S in in any way, fhen fhe square roof of fhe facfor of lengfh | | beginning 

af posifion £ is unaffecfed. Since £ is repefifive, fhe prefix of of lengfh |S^| is again 

in n(a, b) and has fhe same square roof, and so on. Thus yjT^jw) is periodic. Since bofh fhe 
square of fhe period and occur in a suifable Sfurmian word; having equals lengfhs, fhey musf 
be conjugate by Proposifion 2.6. □ 

Proof. We have fhaf |S| > jSsSe], so ^ > 1. Lef u be fhe suffix of S of lengfh |S| — £. Since £ is 
repefifive, fhe facfor v of of lengfh | S^ | sfarfing af posifion £ can be factorized as a producf of 
minimal squares ■ ■ - Y^. We have fhaf \Y^\ > \u \ because £ E Bs. 
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Next we consider how the situation changes if any of fhe words S in is subsfifufed wifh L. 
Subsfifufing the first S with L does not affect the product as ^ > 1. Suppose then that the second 
word S is substituted with L. By applying Lemma 5.5 to the words u and S with X = Yj, we 
see that the factor of lengfh |S^| of SLS sfarfing af position £ can still be factorized as a producf 
of minimal squares and fhaf fhe square roof of fhis facfor coincides wifh fhe square roof of v. 
Consider nexf whaf happens when fhe fhird word S is subsfifufed wifh L. Lef 

r = max{f e {1,... ,m}: \Y^ ■ ■ ■ Yf\ < |S^| — £}. 

Sef £' = i + \Y^ ■ ■ ■ Y^l — |S|. Since £ is nicely repetitive, we have fhaf £' < | S |. By fhe maximalify 
of r and fhe definifion of fhe sef 3$, we fhus have fhaf £' € Applying Lemma 5.5 fo fhe suffix 
of S of lengfh | S | — £' and S wifh X = Y^+i we obfain, like above, fhaf fhe producf of minimal 
squares is affecfed buf fhe square roof is nof. Subsfifufing fhe second and third words S with L 
gives the same result: first proceed as above and substitute the second word S and then make the 
second substitution like above but apply Lemma 5.5 for fhe word L instead of S. 

We have concluded fhaf however we subsfifufe fhe words S in S^, fhe square roof of fhe facfor 
of lengfh |S^| begirming af posifion £ never changes. The word w is obfained from fhe word S'^ 
by subsfifufing some of fhe words S wifh L. By fhe preceding, fhe prefix of T^{w) of lengfh |S^| 
can be facforized as a producf of minimal squares Xj ■ ■ ■ X^. Since £ is repefifive, fhe prefix of 
T^+\s^\ (^zv) of lengfh |S^| can also be facforized as a producf of some minimal squares (perhaps 
differenf) buf fhe square roof still equals Xi ■ ■ ■ X„. By repeating fhis observation we see that 

By our choice of S we have fhaf S e {?j^, L(s]()} where sjt is a reversed sfandard word of some 
slope a = [0; a + 1, b + 1,...]. Lef j5 = [0; bi, ^ 2 ,...] be a number such fhaf = b,- for 1 < i < k 
and bjt+i > 5. Then by fhe definifion of sfandard words G By fhe preceding, fhe prefix 

of T^(S^) of lengfh |S^| can be written as a producf of minimal squares, and fhe square roof of 
these minimal squares equals (Xi ■ ■ ■ X„)^. Since the square root of a Sfurmian word of slope fi 
is a Sfurmian word of slope /3, we have fhaf (Xi ■ ■ ■ X„)^ G B{^). As |Xi ■ ■ ■ X„| = |S|, if follows 
by Proposition 2.6 fhaf Xj ■ ■ ■ X„ is conjugate fo S. Since S is primifive, so is X^ ■ ■ ■ X„, and hence 
fhe period Xj ■ ■ ■ X„ is minimal. □ 

Lemma 8.11. Every seed solution S has at least one nicely repetitive position £ such that £ & 8$- 

Proof. Suppose fhaf S = sjt; for some k>3 and 0 < i < It is suffidenf fo show fhaf r = \sk,i-i I 
is a nicely repefifive posifion of S. If r ^ 8$, then there exists r' G 8$ such that S[r,r' — 1] G 
n(a, b). Since the position r is nicely repetitive, so must r' be. If S = L(?j-,), fhen as r > 1, an 
applicafion of Lemma 5.5 shows fhaf fhe conclusion holds also in fhis case. 

Observe fhat fhe word % is bofh a prefix and a suffix of S. Using fhe facf fhat S/t_ 2 Sj -_3 = 
L(?s-_ 3 sjt_ 2 ) we obfain fhaf 

■5 ^k,i—l^k—l^k—2^k—l^k,i ^k,i—l ' ^k—l^k—2^k—3^l(—2 ' ^k,i—l^k,i 

= Sk,i-1 ■ Sk-lE{Sk-l) ■ 

By Lemma 5.6 fhe word Sj-_iL(sj-_i) is in n(a, b). Since ?ki-i is a solution fo ( 8 ), we have fhaf 

G TI(a, b). Overall, fhe facfor of of lengfh |S^| sfarfing af posifion r is in 

n(a, b). Thus fhe posifion r of S is repefifive. 

Suppose for a confradicfion fhaf fhe suffix of S of lengfh |S| — r is in TI(a, b), fhaf is, S = 
Sjt^z-iXj ■ ■ ■ X^ for some minimal square roofs Xj. It follows fhat Sjt_i = Xj ■ ■ ■ X^. Since Sjt-i 
is a solution to (8), it follows fhaf Sj^-i = (Xj ■ ■ ■ X„)^. This confradicfs fhe primifivify of 
Similarly if the suffix of of lengfh |S^| — r is in TI(a, b), fhen S/t,!-! G n(a, b) contradicting the 
primitivity of sjt i-i- conclude fhaf fhe posifion r is nicely repefifive. □ 
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Lemma 8.10 and Lemma 8.11 now imply the following: 

Corollary 8.12. There exist uncountably many linearly recurrent optimal squareful words having 
(purely) periodic square root. 

Proof. We only need to show that there are uncountably many such words. Consider the words 
in n which can be written as a product of the words S and L. Viewed over the binary alphabet 
{S, L}, these words form an infinite subshift O. Let us show that O is minimal. Then the con¬ 
clusion follows by well-known arguments from topology: a minimal subshift is always finite or 
uncountable and an aperiodic subshift cannot be finite (use the fact that a perfect set is always 
uncountable). 

Let w E Ci (we use the notation of the proof of Lemma 8 . 8 ). Let u E £(zU) be a factor such that 
\u\ > 6 . As \u\ > 6 |S|, every occurrence of u in T must S 5 mchronize to the factorization of T as a 
product of S and L. It follows that every return to m in T is a product of S and L. Since the return 
time of u is finite in T, the return time of the word u in a; is also finite. Hence fl is minimal. □ 

We also prove the following weaker result, which we need later. 

Lemma 8.13. The position |S| — [Sel ofS is repetitive. 

Proof. We prove first by induction that the prefix of the word S(,'s^f of length 2|?/t/| ~ jSel is a 
product of minimal squares for k > 2 and i such that 0 < t < fl/f. Let us first establish the base 
cases. 

Recall that §2 = 0(10“) and S 3 j = S5. We have that 
Sgsf = 10“+^(10“)‘’+^(0(10“)‘’+^)2 = 5^10“+^ (10“)^’+^ = SgSe. 

In addition, for 0 < ^ < 03 , we have that 

= S6S3,iS2"^?3,£ = Sls2~%,e = Sls2~\s2. 

The case £ = 1 is clear. So let us assume that £ > 1. We have that 
Ses^f = Sls^~%s^~\s^ Se, 

so it is sufficient to show that the word s'|“^Sis'|“^SoSj’ is in n(o, b). 

Suppose first that £ — 1 is even. Then as S 2 is a solution to ( 8 ), it is enough to show that 
sis' 2 ~^So^i’ G n(a, b). Since si ?2 = L(? 2 )si, we have that 

SiS^~%S^ = L{s2y~%soS^. 

Now SiSoSj’ = L(s 2 ). The word L(? 2 ) is a solution to ( 8 ), so the conclusion follows as £ — 1 is 
even. 

Suppose next that £ — 1 is odd. We need to show that S 2 Sis|“^SoSi’ G n(a, b). Using the facts 
?i ?2 = L(? 2 )?i and ?i?os' 3 ’ = L(s 2 ) we obtain that 

? 2 SiS 2 "^SoSi’ =S 2 L(s 2 )^"^. 

By Lemma 5.6 the word § 21 ( 52 ) is a product of minimal squares. Since £ — 1 is odd and L(? 2 ) is a 
solution to (8), the conclusion follows. 

We have established the base cases. Now for A: > 4 and 0 < i < aj^, we have that 

= S 6 (Sfc_ 2 §'/_i)^. 

By induction Sfs };-2 = Nj ■ ■ ■ X^Se and SeSjt-i = for some minimal square roots 

Xi,..., X„, Yi,..., Ym- Therefore 

S,sl, = {Xl---Xl{Yl--YlYfS(,. 
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We have thus proved that the prefix of fhe word of lengfh 2|?i- £| — iSel is a producf of 

minimal squares for k >2 and £ such fhaf Q < t < a^. 

Now if S = for some k > 2 and £ such fhat 0 < £ < fhen fhe claim is clear by fhe 
above. Suppose fhaf S = Now if Ss^k,t i n(a, b), fhen fwo applications of Lemma 5.5 

(firsf wifh ii = S(„v = and fhen wifh u = S(,L, v = S) show fhaf fhe claim holds. Assume fhaf 
^ 6 %/ G n(a, b). Since fhe prefix of S^k,l of lengfh 2|sjt^^| — jSel is in n(a, b), fhis means fhaf fhe 
prefix of of lengfh \sk/ \ — | Se | is in LI (a, b). If is sufficienf fo show fhaf fhe prefixes of Sk/ and 
L(sk^i) of lengfh 2 |? 2 | are in n(o, b). Since ?iS 2 = k(s 2 )si, fhe word = ? 2?3 has S 2 L(s 2 ) as a 
prefix. If fl 3 > 1, fhen fhe word ?3 = has L(s 2 )s 2 as a prefix. Finally if a^, = 1, fhen fhe word 
? 5 ^i = ? 3?4 = S 1 S 2 S 4 has L(s 2 )^ as a prefix. Lemma 5.6 shows fhaf J 2 k(s 2 ), L(s 2 )s 2 / and £($ 2 )^ are 
all in n(fl, h). The conclusion follows. □ 

There is no clear pattern for ofher positions in 3$', if depends on the word S if a position in 
Bs is repefifive or nof. The posifion | S | — | Sg | is nof always nicely repetitive. Suppose fhaf a = 1, 
b = 0, and S = §33 = 10(010)^. Then fhe factor beginning af posifion |S| — IS 5 I = 6 of of 
lengfh |S^| is a pro'ducf of minimal squares: (10010)^ ■ (010)^ ■ (100)^. As | (10010)^ ■ (010)^| = 
16 = II — 6 , fhe posifion 6 is nof nicely repefifive. 

Since none of fhe minimal squares can be a proper prefix of anofher minimal square, if is 
easy fo facforize words as producfs of minimal squares from left fo righf. Next we consider what 
happens if we sfarf fo backfrack from a given posifion fo fhe left. 

Lemma 8.14 (Backtracking Lemma). Let X, Yi, ■ ■ ■ Yn be minimal square roots. Let wbea word having 
both of the words and ■■ - as suffixes. If \X\ > |Y„|, then |X| > [Yj ■ ■ ■ Y„| and the word 
Yi ■ ■ ■ Y„ is a suffix of X. 

Proof. Suppose that |X| > \Yn\. We may assume that n is as large as possible. We prove the 
lemma by considering different options for fhe word X. 

Clearly we carmof have fhaf X = Si. Lef X = S 4 . Now X^ can have a proper minimal square 
suffix only if a > 1. If o is even, fhen we musf have fhaf 

x2 = 10 “l(S ?)“/2 and Y„_„/ 2 +i = ... = Y„ = Si. 

The suffix (Si)“^^ of w carmof be preceded by S 2 as ofherwise w would have S 2 S“ = 010^““^ 
as a suffix; fhis is nof possible as 2a — 1 > a. Therefore fhere is no choice for Y„_a/ 2 . Thus 
I I < I I Yi ■ ■ ■ Y„ is a suffix of X. If a is odd, fhen similarly 

x2 = 10 “ 10 (Sf)(“-i )/2 Y„_(„_i)/ 2 +i = ... = Y„ = Si. 

Again fhere is no choice for yn-{a-i)/ 2 ' conclusion holds. Similar considerations show 

thaf fhe conclusion holds if X € { S 2 , S 3 }. 

Lef fhen X = S 5 . If is obvious fhaf now Y„ e {Si,S 3 , S 4 }. If Y„ = Si or b = 0, fhen like 
above Yi = ... = Y„ = Si and Yi ■ ■ ■ Y„ is a suffix of X. We may fhus suppose fhaf b > 0. Say 
Y„ = S 3 . Then we musf have b = 1 and X^ = 10““'"^10““^Y^. Like above, fhe remaining minimal 
square roofs Y, wifh i < n musf equal fo Si and fhere musf be [(a — 1)/2J of fhem. Since fhere 
is no furfher choice, fhe conclusion holds as clearly Yi ■ ■ ■ Y„ is a suffix of X. Suppose fhen fhaf 
b > 1. The nexf case is Y„ = S 4 . Assume firsf fhaf b is even. Then if is sfraighfforward to see that 
necessarily 

yn-b/ 2+1 = ... = Yn = Si and X^ = 10“+i(10“)^10“+i(S|)^/2. 

Thus Y „_{,/2 = Si and, further, it must be that 

Y„_b = ... = Y„_b/ 2 _i = S 2 and X^ = 10 “+il 0 “-HS^)‘’/ 2 s 2 (s 2 )t>/ 2 , 
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Like before, the remaining minimal squares Y, with i < n — b must equal to Si and there must be 
[(a — 1)/2J of them. Therefore 

Y^...Yn = 

is a suffix of X, so the conclusion holds. If b is odd, then in a similar fashion 

so yn-{b-l )/2 — ^rid 

''^Ji-(b-l)/ 2 +l = • • • = Y„ = S 4 and Y„_(,+l = . . . = Yn-ib-l)/!-! = ^ 2 - 
Again, the final [(a—1)/2J minimal square roots must equal Si. Since 

Yi---Yn = 

is a suffix of X, the conclusion holds. 

If X = Se, then it is clear that Y„ 7 ^ S 5 . The conclusion follows as in the case X = S 5 . □ 

The next lemma is useful in the proof of Theorem 8.7. 

Lemma 8.15. Let w be an infinite product of the words S and L and t*!, £ 2 , £3 be positions ofw such that 
£\< £i< £ 3 - Let r be the largest integer such that £i > r|S|. If 

• w[£i, £3 - 1 ],w[£ 2 , £3 - 1] e TI(a, b), 

• £\ — r|S| e Bs, and 

• h < (^ + 1)|S|, 

then for all u e TI(o, b) such that uw[£ 2,£3 — 1] is a suffix ofiv[0 ,13 — 1] we have that ^3 — 1]| < 

\w[£i,l 3 -1 ]|. 

Proof. Let u = ^[^ 1,^3 — 1] and u = w[£ 2,£3 — 1]. Since v,u e n(a, b), we may write v = 
and u = Y^ ■ ■ ■ Y^ for some minimal square roots X, and Y,-. If n > m and = Y, for all 

i G {1,..., m}, then as |u| > \u\, we must have that n > m. This means that the prefix X^ of 
V ends before the position £ 2 , that is, t'l + |X^| < £2 < (r + 1)|S|. This contradicts the fact that 
^1 — r|S| G 8 $. Therefore as |u| > \u\, we we conclude that there exists maximal j G {1,..., m} 
such that X„_^_|_y 7 ^ Yj. If |Yy| > |X„_,„+y|, then by the Backtracking Lemma we have that 
|Xj ■ ■ ■ < |Y?|. This is not possible as Iu| > \u\. Therefore |Yy| < \X„_m+j\- Let z G 

n(a, b) be such that zu is a suffix of w[0, Z 3 — 1]. Write z = ■ ■ ■ Zf for minimal square roots 

Z,. Applying the Backtracking Lemma to the words and Zj ■ ■ ■ Z^Y^ '' 

|Z2 ■ ■ ■ Z}Yl \Xl_^^^.\. It follows that |zu| < |i;|. □ 

Finally we can give a proof of Theorem 8.7. 

Proof of Theorem 8.7. Let w (E Cl. Since T is uniformly recurrent and a product of the words S and 
L, there exists a word re' G O such that zv' is a product of S and L and w = T^{w') for some £ 
such that 0 < £ < | S | (recall that a product of S and L occurring in T having length at least 6 1S | 
must S 5 mchronize to the factorization of T as a product of S and L). If £ = 0, then the conclusion 
holds by Lemma 8 . 8 , so we can assume that £ > 0. Write lu as a product of minimal squares: 
w = XfX^ ■ ■ ■. Let 

ri =max{{0}U{iG {1,2,...}: |X{---X2| < |S|-£}}. 
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If f"! > 0, then set = £ + |Xj ■ ■ ■ | ■ If J'l =0, then we set ii = t. By the maximality of r\ and 

by the definition of the set Sg, it follows that i\ e Sg U {| S |} (indeed, the word L also has S 5 as a 
suffix). See Figure 5. 

To aid comprehension we have separated different parts of the proof as distinct claims with 
their own proofs. Any new definitions and assumptions given in one of the subproofs are valid 
only up to the end of the subproof. 

Claim 8.15.1. Iji\ = |S|, then -Jw e O. 

Proof. Suppose that = |S|. By the definition of the number ri, we have that rj > 0 and the 
word = X 2 ^2 ''' is a product of the words S and L. Now zw' e O 

where z e {S, L}. Since zw' is a product of S and L, by Lemma 8.8 \/zk 7 e fl. By the choice of 
S as a solution to ( 8 ) and by Lemma 5.6, the first |S^| letters of zw' can be written as a product of 
minimal squares. Hence zw' = Yj ■ ■ ■ some minimal square roots Yi,..., Y„. 

By the Backtracking Lemma, we have that Xi ■ ■ ■ X^j is a suffix of Y^ ■ ■ ■ Y„. Thus the word ^/w = 
Xi ■ ■ ■ X,-jXrj+i ■ ■ ■ is a suffix of the word y/zw' = Yi ■ ■ ■ Y„XrjX,.j+i ■ ■ ■. Therefore £{^/w) C 
£(Vz^) =£(r), so y/w e n. □ 

We assume that £-[ E 3$. Now either the position of S is nicely repetitive or it is not. 

Claim 8.15.2. Ifi^ is a nicely repetitive position ofS, then yfw is periodic with minimal period conjugate 
to S. 

Proof. By Lemma 8.10 the word y/T^^w') is periodic with minimal period z conjugate to S. If 
= I, then there is nothing more to prove, so assume that l\ 7 ^ 1. There exists u,v E {S, L} such 
that uvw' E n. Since £1 is a nicely repetitive position of S, the prefix of (uvw') of length | | is a 

product of minimal squares and its square root equals z by Lemma 8.10. Since the factor w' [£, £i — 
1] is also a product of minimal squares, the Backtracking Lemma implies that yf w'[£, £i — 1] is a 
suffix of z. Now y/w = y/w'[£,£i — 1] y/T^'L{w'), SO y/w is periodic with minimal period conjugate 
to S. □ 

If the position £■[ of S is not nicely repetitive, then either it is not repetitive or it is repetitive 
but not nicely repetitive. 

Claim 8.15.3. If £i is repetitive but not nicely repetitive position of S, then y/w E O. 

Proof. Suppose that £1 is a repetitive but not a nicely repetitive position of S. This means that 
either S^[£i, |S| — 1] G n(a, b) or S^[£i, |S^| — 1] G n(a, b) (they both carmot be in n(a, b) as this 
would imply that S is not primitive). Thus either w'[£i, |S| — 1] G n(a, b) or w'[£i, |S^| — 1] G 
n(a, b) (in the latter case Lemma 5.5 ensures that w'[£\, |S^| — 1] G n(a, b)). The former case is, 
however, not possible as it would contradict the maximality of rj. Thus only the latter option is 
possible. Since w' is a product of the words S and L, the prefix w'[Q, |S^| — 1] of w' is a product of 
minimal squares. Since w'\£,£i — l],w'[£i, |S^| — 1] G n(a,b), the Backtracking Lemma implies 
that y/w'[£, |S^| — 1] is a suffix of y/w'[0, \S^\ — 1]. Thus y/w is a suffix of y/u/. As \/u/ G O by 
Lemma 8 . 8 , we conclude that y/w E Cl. □ 

Now we may suppose that £1 is not a repetitive position of S. We let 

r 2 =max{i G {r^ + l,ri +2,...}: |Xj ■ ■ ■ Xj\ < |S^| - £}, 

rg = max{i G {r 2 + 1, r 2 + 2,...}: \X^ ■ ■ ■ Xf\ < |S^| — £}, and 

r 4 = max{i G {rg + l,rg + 2,...}: jX^ ■ ■ ■ Xf\ < |S^| - £}. 
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The numbers r^, r^, and are well-defined as the words S and L are not minimal squares. We set 
i2 = il + |X2+1 
^3 = h + \^r2+l 
h = £3 + |X2+1 

Intuitively, the positions ii, £3, and £4 are the successive positions of w which are closest from 
the left to the boundaries of the words S and L in the factorization of w' as a product of the words 
S and L such that the prefix up to the position is a product of minimal squares; see Figure 5 . Let 
gi = £\, g2 = £2 ~ |S|, g3 = £3 — |S^|, and ^4 = ^4 — |S^|. It is clear by the definitions that 
gi e BU{|S|} for alii e { 1 , 2 , 3 , 4 }. 

Claim 8 . 15 . 4 . We have that gi,g3 7^ |S|. If g2 or ^4 equals |S|, then -Jw e fl. 

Proof. By our assumption that £\ e Bs, we have that gi 7^ |S|. If g2 = |S|, then the factor 
w'\£\, |S^| — 1 ] would be a product of minimal squares. This case was already considered in 
Claim 8 . 15.3 where we concluded that \fw e O. 

Suppose that g3 = |S|. Consider the positions £3 and |S| of zv'. Both of the factors u = 
w'[£\,£3 — 1 ] and v = w'[|S|,^3 — 1 ] = w'[£3 — |S^|,^3 — 1 ] are in n(a, b). Now zw' e O for some 
z e {S, L}. Since SS, SL, LS, LL e n(o, b), the prefix of zie' of length |S^| is inn(a, b). Lemma 8.15 
applied to the word (zw')[ 0 , |S|-|-^3 — 1 ] implies that |(za;')[ 0 , |S|-F Z3 — 1 ]| < \u\ < |S^|whichis 
nonsense. Therefore ^3 7^ |S|. 

Assume then that g^ = |S|. Suppose for a contradiction that g 2 7 ^ gi- Both of the factors 
u' = w'[t' 2,^4 — 1] and v' = w'[\S \'^,£4 — 1] = w '\£/3 — |S ^|,^4 — 1] are in n(o, b). Since g 2 7 ^ gi, 
also £2 7 ^ |S^|. Thus by the definition of £ 2 , we have that £2 < IS 2 I. Lemma 8.15 applied to the 
word w'[0,£i — 1] shows that |w'[0, ^4 — 1]| < \u'\ < |S^| which is absurd. This contradiction 
shows that ^2 = g't = |S|, so 3 /w e fl. □ 

We may now assume that g, e 3 $ for all i e { 1 , 2 , 3 , 4 }. 

Claim 8 . 15 . 5 . The position g2 of S is nicely repetitive. 

Proof. Assume on the contrary that neither of the positions g2 and g3 is a repetitive position of 
S. First note that as gi is not repetitive, we have that g3 gi. Similarly g2 gi. If gi = g2, 
then it follows from Lemma 5.5 and the definitions of the positions I2 and I3 that g2 = gs; a 
contradiction. Hence gi 7^ g2. Similarly g2 7^ g3 as otherwise the position g2 would be repetitive. 
Finally, g3 7^ g4 because g3 is not repetitive. We have two cases: either gi = g4 or gi 7F g4. 

Assume that g4 7F g4. By Lemma 8.13 the position |S| — [Sel of S is repetitive, so gi,g2,g3 G 

\ {|S| - iSgl} = {|S| - |Si|,|S| - |S 3 |,|S| - IS 4 I}. Since all of the positions gi, g 2 , and g 3 
are distinct, the only option is that g 4 = |S| — IS 5 I. Since the position |S| — jSel is repetitive, by 
Lemma 5.5 the factor u = w'[£i — |S ^|,£4 — 1] is in n(a, b). By the definition of the positions £ 2 , 
£ 3 , and £i also v = zv'[£ 2 ,£i — 1] G n(o, b). Since g 2 7 ^ g 4 , also £2 £i— |S^|. Since |S| — jSel 

is the smallest element of the set 65 , we have that £2 > £i — |S^|. As zv[li, Z 2 — 1] G n(a, b), we 
obtain by Lemma 8.15 that \zv[li, Z 4 — 1] | < \u\ = |S^|. This is a contradiction. 

Hence we have that gi = g4. Since the factor w\£\, £2 — 1 ] is a product of minimal squares, the 
number C\= £2 — £\ is even. Similarly the numbers C2 = £3 — £2 arid C3 = £4 — £3 are even. Thus 
the number ci -|- C2 -F C3 = 3 |S| is even, so |S| is even. It follows that the numbers = g2 — gi, 
^2 = g3 “ g2/ arid ^3 = g4 — g3 = gi — g3 are even. However, exactly two of the numbers |Si |, 
IS3I, and IS4I have odd length. Hence exactly two of the numbers gi, g2, and g3 are odd. Thus it 
is not possible that all of the numbers di, d2, and ^3 are even. This is a contradiction. 

The previous contradiction shows that either of the positions g2 and g3 is a repetitive position 
of S. Suppose for a contradiction that g3 is repetitive. We have that zv'\£\,£3 — 1 ] G n(a, b) 
and ii;'[^3 — |S^|,^3 — 1 ] G n(a, b). Similar to the second paragraph of this subproof, using 


I 

^1-2 I' 

■ ■ ■ x^j, and 
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S/L S/L S/L S/L 

Figure 5: The positions I, l\, t? 2 , ^ 3 , and of w' and the minimal squares between the positions. 


Lemma 8.15 we obtain a contradiction unless = ^ 3 . Even this conclusion is contradic¬ 
tory as gi is not repetitive. Therefore g 3 can not be repetitive, so g 2 is a repetitive position 
of S. Now if g 2 would not be nicely repetitive, we would have by the maximality of r 2 that 
w'[i2, |S^| — 1] e n(o, b), that is, ^3 = |S|. However, since g^ e Bs, we have that g2 is a nicely 
repetitive position of S. □ 

We are now in the final stage of the proof. We will show that is periodic with minimal 
period conjugate to | S |. 

We can now argue as in the proof of Claim 8.15.2. Since g 2 is a nicely repetitive position of S, 
by Lemma 8.10 the word is periodic with minimal period z conjugate to S. We have that 

uw' e n for some u e {S, L}. Since g 2 is a nicely repetitive position of S, the prefix of TS^{uw') 
of length |S^| is a product of minimal squares and its square root equals z by Lemma 8.10. Since 
w'[(., 1^2 “ 1] € n(Q/ &)/ the Backtracking Lemma implies that \Jw'\i,I 2 ~ 1] is a suffix of z. Now 
y/w = y/w'[i,l2 — so y/w is poriodic with minimal period conjugate to S. 

By Lemma 8.11 the word S always has at least one nicely repetitive position. It therefore 
follows that there exists a word in fl having periodic square root. □ 


9 Remarks on Generalizations 

It is natural to think that the square root map could be generalized to obtain a cube root map and, 
further, a root map. However, in [16, Theorem 5.3.] Saari proves the following reformulation 
of a result of Mignosi, Restivo, and Salemi. 

Proposition 9 . 1 . If w is an everywhere oc-repetitive word with a > cp + 1, where cp is the golden mean, 
then w is ultimately periodic. 

Generalizing the square root map to a cube root map would require everywhere 3-repetitive 
words. By the above such words must be ultimately periodic, so we expect that this direction of 
research would not be fruitful. 

Another way to generalize the square root map is to use abelian powers instead of ordinary 
powers. For abelian powers a result like Proposition 9.1 does not exist. For instance, by [15, 
Theorem 1.9.] every position in a Sturmian word begins with an abelian power for all k > 
2. Abelian square root can be defined for e.g. optimal squareful words as we will see shortly. 
However, abelian cubes in Sturmian words do not work. Consider again the Fibonacci word /. 
The minimal abelian cube prefix of T(/) is 10 ■ 01 ■ 01. This abelian cube is followed by the factor 
00, so the root of the next abelian cube must begin with 00. Hence if we define the abelian cube 
root of T (/) to be the product of the roots of the abelian cubes, the resulting word begins with 
1000 which is not a factor of /. Thus by defining an abelian cube root map in this way, we lose 
the main property that the mapping preserves the languages of Sturmian words. 

In [17] Saari also considers optimal abelian squareful words. Optimal abelian squareful words 
are defined by replacing minimal squares with minimal abelian squares in the definition of op¬ 
timal squareful words. Let w = X 1 X 2 X 2 X 2 ■ ■ ■ be a product of minimal abelian squares X,X'. 
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We define its abelian square root as the word ‘iyw = X 1 X 2 ■ ■ ■. It follows from [17, Theorem 18] 
that the six minimal squares are products of exactly five minimal abelian squares (this is straight¬ 
forward to verify directly). Thus if w is an optimal squareful word, then = ‘i^w. Thus by 
Theorem 3.2 the abelian square root of a Sturmian word Sx,^ is the Sturmian word Also, 

by Theorem 8.7 there exists a minimal subshift O such that for all ic € O either G O or 
ij/a; is periodic. Saari proves in [17, Theorem 19] that an optimal abelian squareful word must 
have at least five distinct minimal abelian squares, but he leaves the characterization of these sets 
of minimal abelian squares open. Thus it is possible that there exists optimal abelian squareful 
words which contain other minimal abelian squares than those given by [17, Theorem 18]. For 
such words the abelian square root map could exhibit different behavior than the square root 
map (if the square root map is even defined for such words). We have not extended our research 
to this direction. 

We could also generalize the special function ip. Divide the distance D between x and 1 — a 
into k parts and choose the image of x to be x -|-1D among the points 

X -|- yD, X yD ,. . ., X H- - -D 

k k k 

to obtain the function 

1 

^k,t : T —t T,x !->• -{tx + {k — t){l — a)). 

K 

The map is a perfectly nice function on the circle T, but to make things interesting we would 
need to find a symbolic interpretation for it. We have not figured out any such interpretation for 
these generalized functions. 

10 Open Problems 

In the Section 8 we saw that there are non-Sturmian words whose language is preserved un¬ 
der the square root map. However, Sturmian words satisfy an even stronger property: by 
Theorem 3.2 for the Sturmian subshift of slope a it holds that x/Clx C fla. This property 
is not satisfied by the aperiodic and minimal subshift Or of the word T constructed in Section 8 
since by Theorem 8.7 there is a word in Op having periodic square root; since Op is aperiodic and 
minimal, it cannot contain such words. We are thus led to ask the following question we could 
not answer: 

Question. If Cl is a subshift containing optimal squareful words satisfying \/n C O, does the subshift 
Q only contain Sturmian words? 

Let us briefly see that if we do not require all words in O to be aperiodic then the above 
question has a negative answer. 

Proposition 10.1. There exists a non-minimal non-Sturmian subshift Cl containing squareful words such 
that -v/n C n. 

Proof Sketch. Let S be a seed solution as in Section 8, and let T be a corresponding fixed point 
of the square root map generated by the seed S as in Section 8. Further, set A = S'^, let 
be the subshift generated by A, and let Op be the subshift generated by F. If w Cz Op, then by 
Theorem 8.7 either G Qp or y/w G n^. Hence if we are able to show that y/Cfy C Q^, then 
the non-minimal and non-Sturmian subshift flp U Cl/y has the desired properties. 

Let w G Ha, so w = fy(A) for some 0 < t < |S|. Write a; as a product of minimal squares: 
w = X^X 2 ■ ■ ■. We can now argue as in the proof of Theorem 8.7. If jX^ ■ ■ ■ X^| = |S| — t for 
some n > 1 or |X^ ■ ■ ■ X^| = \S'^\ — (. for some m > 1, then using the fact that -v/A = A it is 
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straightforward to see that \/w G O^. Otherwise either ^ is a nicely repetitive position of S or 
is a nicely repefifive posifion of S where 

i = max {7 e {1,2,...}: |X{ ■ ■ ■ Xj\ < |S^| — 1}. 

In bofh of fhese cases we deduce wifh fhe help of Lemma 8.10 fhaf -Jw G 0^- □ 

There are ofher interesfing relafed questions. Consider fhe limif sef 

n n Vn n n.... 

We know very liffle abouf fhe limif sef excepf in fhe Sfurmian case when if confains fhe two fixed 
poinfs OlCa and lOca. For fhe word F of Secfion 8 we proved fhaf fhe limit set contains at least 
two fixed poinfs. We ask: 

Question. When is the limit set nonempty? If it is nonempty, does it always contain fixed points? Can it 
contain points which are not fixed points? 

It is a genuine possibility that the limit set is empty. Consider for insfance fhe word f = 
T{cr‘^{6)), fhe morphic image of fhe fixed poinf of fhe morphism cr : 6 i—)• 656556,5 i—)• 5 under 
T : 6 I—>• Sg, 5 I—>• Sg where S 5 = 100 and Se = 10010 are minimal square roofs of slope a = 
[0; 2,1,...]. It is straightforward fo verify that f is optimal squareful and uniformly recurrenf and 
fhaf fhe refurns fo fhe factor 101 in are 10100,101(001)^00 and 101(001)^00. By considering 
all possible occurrences of fhe factor w = t(56565) G jC.{Q in any producf of minimal squares of 
slope a, if can be shown fhaf fhe square roof of fhe producf always confains a refurn fo fhe factor 
101 which is nof in £(^). Since fhe facfor w occurs in every poinf in fhe subshiff generated by 
f, we conclude fhaf n ynf = 0. 

In Secfion 8 we consfrucfed infinite families of primifive solutions fo ( 8 ) using fhe recur¬ 
rence Jk+i = ^{7k)7k' consfrucfion worked was because fhe seed solufion S and 

fhe word L = L(S) satisfy = S, ^/^ = S, \/LS = L, and ^/LL = L, fhaf is, 

a/(LSS)^ = VLS ■ SL ■ SS = LSS. Similarly a/(SLLLL)^ = SLLLL, so subsfifufing for exam¬ 
ple S = 01010010 we obtain the primitive solution 

S 2 S 1 S 4 S 3 S 5 S 4 S 3 S 5 S 6 S 5 S 4 S 3 S 5 S 4 S 3 =0101001010010010100100101001001010010010 

to ( 8 ) in £(1,0). More solutions can be obtained with analogous constructions. Restricting to the 
languages of optimal squareful words, we ask: 

Question. What are the primitive solutions w of ( 8 ) in L{a, b) such that w or w^ is not Sturmian and w 
is not obtainable by the above construction? 
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